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CONTENTS OVERVIEW 


Historically, Integrated Device Technology has presented our product offerings entirely under 
one cover. For ease of use for our customers, we have divided the products into fourseparate data 
books — Logic, Specialized Memory, RISC and Static RAM. 

IDT’s 1991 RISC Data Book is comprised of new and revised data sheets and application notes 
for the RISC and RISC Subsystem product lines. Also included is a current, complete packaging 
section for all IDT product groups. This section will be updated in each subsequent data book with 
the latest available packages. 

The RISC Data Book’s T able of Contents contains a listing of the products contained in the 1 991 
RISC Data Book, as well as those products which are contained in the other three data books. The 
numbering scheme is slightly different from the past. The number in the bottom center of the page 
denotes the section number and the sequence of the data sheet within that section, (i.e. 5.5 would 
be the fifth data sheet in the fifth section). The number in the lower right hand corner is the page 
number of that particular data sheet. 

Integrated Device Technology, a recognized leaderin high-speed CMOS technology, produces 
a broad line of products, enabling us to provide a complete CMOS solution to designers of high- 
performance digital systems. Our products include industry standard devices, as well as products 
with speed, lower power, package and/or architectural benefits that allow the designer to achieve 
significantly improved system performance. 

Use this book to find ordering information: Start with the Ordering Information chart at the 
back of each data sheet or the Cross Reference Guides (in Section 1), along with the Package 
Outline Index (page 4.2), to compose the complete IDT part number. Reference data on our 
Technology Capabilities and Quality Commitments are included in separate sections (2 and 3, 
respectively). 

Use this book to find product data: Start with the Table of Contents, organized by product 
line (page 1.3), or with the Numeric Table of Contents across all product lines (page 1 .4). These 
indexes will direct you to the page on which the complete technical data sheet can be found. Data 
sheets may be of the following type: 

ADVANCE INFORMATION — contain initial descriptions, subject to change, for products that 
are in development, including features and block diagrams. 

PRELIMINARY — contain descriptions for products soon to be, or recently, released to 
production, including features, pinouts and block diagrams. Timing data are based on simulation 
or initial characterization and are subject to change upon full characterization. 

FINAL — contain minimum and maximum limits specified over the complete supply and 
temperature range for full production devices. 

New products, product performance enhancements, additional package types and new product 
families are being introduced frequently. Please contact your local IDT sales representative to 
determine the latest device specifications, package types and product availability. 


LIFE SUPPORT POLICY 

Integrated Device Technology’s products are not authorized for use as critical components in life support devices 
or systems unless a specific written agreement pertaining to such Intended use is executed between the manufac- 
turer and an officer of IDT. 

1 . Life support devices or systems are devices or systems which (a) are Intended for surgical implant into the body 
or (b) support or sustain life and whose failure to perform, when properly used in accordance with instructions for 
use provided in the labeling, can be reasonably expected to result in a significant injury to the user. 

2. A critical component is any component of a life support device or system whose failure to perform can be rea- 
sonably expected to cause the failure of the life support device or system, or to affect its safety or effectiveness. 


Note: Integrated Device Technology, Inc. reserves the right to make changes to its products or specifications at any time, without notice, 
in order to improve design or performance and to supply the best possible product. IDT does not assume any responsibility for use of any 
circuitry described other than the circuitry embodied in an IDT product. The Company makes no representations that circuitry described 
herein is free from patent infringement or other rights of third parties which may result from its use. No license is granted by implication or 
otherwise under any patent, patent rights or other rights, of Integrated Device Technology, Inc. 
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1990-91 LOGIC DATA BOOK 

The following Is a listing of the data sheets located in the 1990-91 Logic Data Book available under separate cover: 


COMPLEX LOGIC PRODUCTS PAGE 

DSP AND MICROSLICE™ PRODUCTS 

IDT39C01 4-Bit Microprocessor Slice 5.1 

IDT39C10 12-Bit Sequencer 5.2 

IDT49C402 16-Bit Microprocessor Slice 5.3 

IDT49C410 16-Bit Sequencer 5.4 

IDT7210L 16x16 Parallel Multiplier-Accumulator 5.5 

IDT7216L 16x16 Parallel Multiplier 5.6 

IDT7217L 16x16 Parallel Multiplier (32 Bit Output) 5.6 

IDT7381L 16-Bit CMOS Cascadable ALU 5.7 

IDT7383L 16-Bit CMOS Cascadable ALU 5.7 

READ/WRITE BUFFER PRODUCTS 

IDT73200L 16-Bit CMOS Multilevel Pipeline Register 5.8 

IDT73201L 16-Bit CMOS Multilevel Pipeline Register 5.8 

IDT73210 Fast Octal Register Transceiver w/Parity 5.9 

IDT73211 Fast Octal Register Transceiver w/Parity 5.9 

ERROR DETECTION AND CORRECTION PRODUCTS 

IDT39C60 16-Bit Cascadable EDC 5.10 

IDT49C460 32-Bit Cascadable EDC 5.11 

IDT49C465 32-Bit CMOS Flow-ThruEDC Unit 5.12 

IDT49C466 64-BIT CMOS Flow-ThruEDC Unit 5.13 

GRAPHICS PRODUCTS 

IDT75C457 CMOS Single 8-Bit PaletteDAC™ for True Color Applications 5.14 

IDT75C458 Triple 8-Bit PaletteDAC™ 5.15 

IDT75C48 8-Bit Flash ADC 5.16 

IDT75C58 8-Bit Flash ADC with Overflow Output 5.17 

STANDARD LOGIC PRODUCTS 

IDT29FCT52T Non-inverting Octal Registered Transceiver 6.1 

IDT29FCT53T Inverting Octal Registered Transceiver 6.1 

IDT29FCT520T Multi-level Pipeline Register 6.2 

IDT29FCT521T Multi-level Pipeline Register 6.2 

IDT54/74FCT1 38T 1-of-8 Decoder 6.3 

IDT54/74FCT 1 39T Dual l-of-4 Decoder 6.4 

IDT54/74FCT151T 8-Input Multiplexer 6.5 

IDT54/74FCT251T 8-Input Multiplexer w/3-State 6.5 

IDT54/74FCT157T Quad 2-Input Multiplexer 6.6 

IDT54/74FCT257T FQuad 2-Input Multiplexer w/3-State 6.6 

IDT54/74FCT161T Synchronous Binary Counter w/Asynchronous Master Reset 6.7 

IDT54/74FCT163T Synchronous Binary Counter w/Synchronous Reset 6.7 

IDT54/74FCT191T Up/Down Binary Counter w/Preset and Ripple Clock 6.8 

IDT54/74FCT193T Up/Down Binary Counter w/Separate Up/Down Clocks 6.9 

IDT54/74FCT240T Inverting Octal Buffer/Line Driver 6.10 

IDT54/74FCT241T Non-inverting Octal Buffer/Line Driver 6.10 

IDT54/74FCT244T Non-inverting Octal Buffer/Line Driver 6.10 

IDT54/74FCT540T Inverting Octal Buffer/Line Driver 6.10 

IDT54/74FCT541T Non-inverting Octal Buffer/Line Driver 6.10 

IDT54/74FCT245T Non-inverting Octal Transceiver 6.11 



1990-91 LOGIC DATA BOOK (CONTINUED) PAGE 

STANDARD LOGIC PRODUCTS (CONTINUED) 

IDT54/74FCT640T Inverting Octal Transceiver 6.11 

IDT54/74FCT645T Non-inverting Octal Transceiver 6.11 

IDT54/74FCT273T Octal D Flip-Flop w/Common Master Reset 6.12 

IDT54/74FCT299T 8 Input Universal Shift Register w/Common Parallel I/O Pins 6.13 

IDT54/74FCT373T Non-inverting Octal Transparent Latch w/3-State 6.14 

IDT54/74FCT533T Inverting Octal Transparent Latch w/3-State 6.14 

IDT54/74FCT573T Non-inverting Octal Transparent Latch w/3-State 6.14 

IDT54/74FCT374T Non-inverting Octal D Register 6.15 

IDT54/74FCT534T Inverting Octal D Register 6.15 

IDT54/74FCT574T Non-inverting Octal D Register 6.15 

IDT54/74FCT377T Octal D Flip-Flop w/Clock Enable 6.1 6 

IDT54/74FCT399T Quad Dual-Port Register 6.17 

IDT54/74FCT521T 8-Bit Identity Comparator 6.18 

I DT54/74FCT543T Non-inverting Octal Latched Transceiver 6.19 

IDT54/74FCT646T Non-inverting Octal Registered Transceiver 6.20 

IDT54/74FCT648T Inverting Octal Registered Transceiver 6.20 

IDT54/74FCT651T Inverting Octal Registered Transceiver 6.20 

IDT54/74FCT652T Non-inverting Octal Registered Transceiver 6.20 

IDT54/74FCT620T Inverting Octal Bus Transceiver w/3-State 6.21 

IDT54/74FCT623T Non-inverting Octal Bus Transceiver w/3-State 6.21 

IDT54/74FCT621T Non-inverting Octal Bus Transceiver (Open Drain) 6.22 

IDT54/74FCT622T Inverting Octal Bus Transceiver (Open Drain) 6.22 

IDT54/74FCT821T 10-Bit Non-inverting Register w/3-State 6.23 

IDT54/74FCT823T 9-Bit Non-inverting Register w/Clear & 3-State 6.23 

I DT54/7 4 F CT8 25T 8-Bit Non-inverting Register w/Clear & 3-State 6.23 

IDT54/74FCT827T 10-Bit Non-inverting Buffer 6.24 

IDT54/74FCT828T 10-Bit Inverting Buffer 6.24 

IDT54/74FCT841T 10-Bit Non-inverting Latch 6.25 

IDT54/74FCT843T 9-Bit Non-inverting Latch 6.25 

IDT54/74FCT845T 8-Bit Non-inverting Latch 6.25 

IDT29FCT52 Non-inverting Octal Registered Transceiver 6.26 

IDT29FCT53 Inverting Octal Registered Transceiver 6.26 

IDT29FCT520 Multi-level Pipeline Register 6.27 

IDT49FCT661 16-Bit Synchronous Binary Counter 6.28 

IDT49FCT804 High-Speed Tri-Port Bus Multiplexer 6.29 

IDT49FCT805 Buffer/Clock Driver w/Guaranteed Skew 6.30 

IDT49FCT806 Buffer/Clock Driver w/Guaranteed Skew ... 6.30 

IDT49FCT818 Octal Register with SPC™ 6.31 

IDT49C25 Microcycle Length Controller 6.32 

IDT39C8XX IDT39C8XXX Family 6.33 

IDT54/74FCT138 1-of-8 Decoder 6.34 

IDT54/74FCT1 39 Dual 1 -of-4 Decoder 6.35 

IDT54/74FCT1 61 Synchronous Binary Counter w/Asynchronous Master Reset 6.36 

IDT54/74FCT163 Synchronous Binary Counter w/Synchronous Reset 6.36 

IDT54/74FCT182 Carry Lookahead Generator 6.37 

IDT54/74FCT191 Up/Down Binary Counter w/Preset and Ripple Clocks 6.38 

IDT54/74FCT193 Up/Down Binary Counter w/Separate Up/Down Clocks 6.39 

IDT54/74FCT240 Inverting Octal Buffer/Line Driver 6.40 

IDT54/74FCT241 Non-inverting Octal Buffer/Line Driver 6.40 

IDT54/74FCT244 Non-inverting Octal Buffer/Line Driver 6.40 

IDT54/74FCT540 Inverting Octal Buffer/Line Driver 6.40 

IDT54/74FCT541 Non-inverting Octal Buffer/Line Driver 6.40 

IDT54/74FCT245 Non-inverting Octal Transceiver 6.41 

IDT54/74FCT640 Inverting Octal Transceiver 6.41 
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STANDARD LOGIC 

IDT54/74FCT645 
IDT54/74FCT273 
IDT54/74FCT299 
IDT54/74FCT373 
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IDT54/74FCT821 
IDT54/74FCT823 
I DT54/74FCT824 
IDT54/74FCT825 
IDT54/74FCT827 
IDT54/74FCT833 
IDT54/74FCT841 
I DT54/74FCT843 
IDT54/74FCT844 
I DT54/74FCT845 
IDT54/74FCT861 
IDT54/74FCT863 
IDT54/74FCT864 


PRODUCTS (CONTINUED) 

Non-inverting Octal Transceiver 

Octal D Flip-Flop w/Common Master Reset 

8-Input Universal Shift Register w/Common Parallel I/O Pins. 

Non-inverting Octal Transparent Latch 

Inverting Octal Transparent Latch 

Non-inverting Octal Transparent Latch 

Non-inverting Octal D Flip-Flop 

Inverting Octal D Flip-Flop w/3-State 

Non-inverting Octal D Register w/3-State... 

Octal D Flip-Flop w/Clock Enable 

Quad Dual-Port Register ; 

8- Bit Identity Comparator 

Non-inverting Octal Latched Transceiver : 

Non-inverting Octal Registered Transceiver 

10-Bit Non-inverting Register w/3-State 

9- Bit Non-inverting Register w/Clear & 3-State 

9- Bit Inverting Register w/Clear & 3-State 

8-Bit Non-inverting Register 

10- Bit Non-inverting Buffer 

8- Bit Transceiver w/Parity 

10-Bit Non-inverting Latch 

9- Bit Non-inverting Latch ..... 

9- Bit Inverting Latch 

8- Bit Non-inverting Latch 

10- Bit Non-inverting Transceiver 

9- Bit Non-inverting Transceiver 

9-Bit Inverting Transceiver 
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6.42 

6.43 

6.44 
6.44 

6.44 

6.45 
6.45 

6.45 

6.46 

6.47 

6.48 

6.49 

6.50 

6.51 
6.51 
6.51 

6.51 

6.52 

6.53 

6.54 
6.54 
6.54 

6.54 

6.55 
6.55 
6.55 



IDT54/74FBT240 Inverting Octal Buffer/Line Driver 6.56 

IDT54/74FBT241 Non-inverting Octal Buffer/Line Driver 6.57 

IDT54/74FBT244 Non-inverting Octal Buffer/Line Driver 6.58 

IDT54/74FBT245 Non-inverting Octal Transceiver 6.59 

IDT54/74FBT373 Octal Transparent Latch w/3-State 6.60 

IDT54/74FBT374 Non-inverting Octal D Register 6.61 

IDT54/74FBT540 Inverting Octal Buffer 6.62 

IDT54/74FBT541 Non-inverting Octal Buffer 6.62 

IDT54/74FBT821 10-Bit Non-inverting Register 6.63 

IDT54/74FBT823 9-Bit Inverting Register 6.64 

IDT54/74FBT827 Non-inverting 10-Bit Buffers/Driver 6.65 

IDT54/74FBT828 InvertinglO- Bit Buffers/Driver 6.65 

IDT54/74FBT841 10-Bit Non-inverting Latch 6.66 

IDT54/74FBT2240 Inverting Octal Buffer/Line Driver w/25Q Series Resistor 6.67 

IDT54/74FBT2244 Inverting Octal Buffer/Line Driver w/25Q Series Resistor 6.68 

IDT54/74FBT2373 Octal Transparent Latch w/3-State & 250 Series Resistor 6.69 

IDT54/74FBT2827 Non-inverting 10-Bit Buffers/Driver w/250 Series Resistor 6.70 

IDT54/74FBT2828 Invertingl 0-Bit Buffers/Driver w/250 Series Resistor 6.70 

IDT54/74FBT2841 10-Bit Memory Latch w/25Q Series Resistor 6.71 


APPLICATION AND TECHNICAL NOTES 

Complex Logic Products Technical Notes 

TN-02 Build a 20MIP Data Processing Unit 7.1 

TN-03 Using the IDT49C402A ALU 7.2 
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Complex Logic Products Application Notes 

AN-03 Trust Your Data with A High-Speed CMOS 6-, 32- or 64-Bit EDC 7.3 

AN-06 16-Bit CMOS Slices — New Building Blocks Maintain Microcode 

Compatibility Yet Increase Performance 7.4 

AN-17 FIR Filter Implementation Using FIFOs and MACs 7.5 

AN-24 Designing with the IDT49C460 and IDT39C60 Error Detection and 

Correction Units 7.6 

AN-32 Implementation of Digital Filters Using IDT7320, IDT7210, IDT7216 7.7 

AN-35 Address Generator in Matrix Unit Operation Engine 7.8 

AN-37 Designing High-Performance Systems Using the IDT PaletteDAC™ 7.9 

AN-63 Using the IDT75C457's PaletteDAC™ in True Color and Monochrome 

Graphics Applications 7.10 

AN-64 Protecting Your Data with IDT's 49C465 32-Bit Flow-thruEDC™ Unit 7.1 1 

AN-65 Using IDT73200 or IDT73210 as Read and Write Buffers with R3000 7.12 

Standard Logic Application Notes 7.13 

AN-47 Simultaneous Switching Noise 7.14 

AN-48 Using High-Speed Logic 7.15 

AN-49 Characteristics of PCB Traces 7.16 

AN-50 Series Termination 7.17 

AN-51 Power Dissipation in Clock Drivers 7.18 

AN-52 FCT Output Structures and Characteristics 7.19 

AN-53 Power-Down Operation 7.20 

AN-54 FCT-T Logic Family 7.21 

Standard Logic Technical Bulletins 7.22 
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SPECIALIZED MEMORIES DATA BOOK 

The following is a listing of the data sheets located in the 1990-91 Specialized Memories Data Book available 
under separate cover: 


ECL PRODUCTS PAGE 

I DTI 0484 4K x 4 ECL 1 0K SRAM (Corner Power) 5.1 

I DTI 00484 4Kx 4 ECL 100K SRAM (Corner Power) . 5.1 

IDT101484 4Kx 4 ECL 101K SRAM (Corner Power) 5.1 

IDT10A484 4Kx 4 ECL 10K SRAM (Center Power) 5.2 

IDT100A484 4K x 4 ECL 100K SRAM (Center Power) 5.2 

IDT101 A484 4Kx 4 ECL 101K SRAM (Center Power) 5.2 

I DTI 0490 64K x 1 ECL 10K SRAM 5.3 

I DTI 00490 64Kx1 ECL 100K SRAM 5.3 

IDT101490 64Kx1 ECL 101 K SRAM 5.3 

I DTI 0494 16K x 4 ECL 10K SRAM 5.4 

I DTI 00494 16K x 4 ECL 100K SRAM 5.4 

I DTI 01 494 16K x 4 ECL 101K SRAM 5.4 

IDT10496LL 16K x 4 Self-Timed Latch Input, Latch Output 5.5 

IDT100496LL 16K x 4 Self-Timed Latch Input, Latch Output 5.5 

IDT101496LL 16K x 4 Self -Timed Latch Input, Latch Output 5.5 

IDT10496RL 16K x 4 Self-Timed Reg Input, Latch Output 5.6 

IDT100496RL 16K x 4 Self-Timed Reg Input, Latch Output 5.6 

IDT101496RL 16K x 4 Self-Timed Reg Input, Latch Output 5.6 

IDT10497 16K x 4 Synchronous Write, Latch Output 5.7 

I DTI 00497 16K x 4 Synchronous Write, Latch Output 5.7 

IDT101497 16K x 4 Synchronous Write, Latch Output 5.7 

I DTI 0498 16K x 4 Conditional Write, Latch Output 5.8 

IDT100498 16K x 4 Conditional Write, Latch Output 5.8 

IDT101498 16K x 4 Conditional Write, Latch Output 5.8 

I DTI 0504 64K x 4 ECL 10K SRAM 5.9 

I DTI 00504 64K x 4 ECL 100K SRAM 5.9 

I DTI 01 504 64K x 4 ECL 100K SRAM 5.9 

IDT10506LL 64K x 4 Self-Timed Latch Input, Latch Output 5.10 

IDT100506LL 64K x 4 Self-Timed Latch Input, Latch Output 5.10 

IDT101506LL 64K x 4 Self-Timed Latch Input, Latch Output 5.10 

IDT10506RL 64K x 4 Self-Timed Reg Input, Latch Output 5.11 

IDT100506RL 64K x 4 Self-Timed Reg Input, Latch Output 5.11 

IDT101506RL 64K x 4 Self-Timed Reg Input, Latch Output... 5.11 

IDT10507 64K x 4 Synchronous Write, Latch Output 5.12 

IDT100507 16K x 4 Synchronous Write, Latch Output 5.12 

IDT101507 16K x 4 Synchronous Write, Latch Output 5.12 

IDT10508 64K x 4 Conditional Write, Latch Output 5.13 

I DTI 00508 64K x 4 Conditional Write, Latch Output 5.13 

IDT101508 64K x 4 Conditional Write, Latch Output 5.13 

I DTI 0509 32K x 9 ECL 1 0K SRAM 5.14 

I DTI 00509 32K x 9 ECL 1 00K SRAM 5.14 

IDT101509 32K x 9 ECL 1 01 K SRAM 5.14 

FIFO PRODUCTS 

IDT7200 256 x 9-Bit Parallel FIFO 6.1 

IDT7201 51 2 x 9-Bit Parallel FIFO 6.1 

IDT7202 1 024 x 9-Bit Parallel FIFO 6.2 

IDT7203 2K x 9-Bit Parallel FIFO 6.3 

IDT7204 4K x 9-Bit Parallel FIFO 6.3 

IDT7205 8K x 9-Bit Parallel FIFO 6.4 

IDT7206 1 6K x 9-Bit Parallel FIFO 6.5 
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FIFO PRODUCTS (CONTINUED) 

IDT72021 1 K x 9-Bit Parallel FIFO w/ Flags and Output Enable . 6.6 

IDT72031 2K x 9-Bit Parallel FIFO w/Flags and Output Enable 6.6 

IDT72041 4K x 9-Bit Parallel FIFO w/Flags and Output Enable 6.6 

IDT72103 2K x 9-Bit Configurable Parallel-Serial FIFO 6.7 

IDT72104 4K x 9-Bit Configurable Parallel-Serial FIFO 6.7 

IDT72105 256 x 16-Bit Parallel-to-Serial FIFO 6.8 

IDT72115 512 x 16-Bit Parallel-to-Serial FIFO 6.8 

IDT72125 1024 x 16-Bit Parallel-to-Serial FIFO. 6.8 

IDT72131 2048 x 9-Bit Parallel-to-Serial FIFO 6.9 

IDT72141 4096 x 9-Bit Parallel-to-Serial FIFO 6.9 

IDT72132 2048 x 9-Bit Serial-to-Parallel FIFO 6.10 

IDT72142 4096 x 9-Bit Serial-to-Parallel FIFO 6.10 

IDT72200 256 x 8-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.1 1 

IDT72210 512 X 8-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.1 1 

IDT72420 64 x 8-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.1 1 

IDT72201 256 x 9-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.12 

IDT72211 512 X 9-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.12 

IDT72421 64 x 9-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.12 

IDT72215A 512 x 18-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.13 

IDT72225A 1024 x 18-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.13 

IDT72220 IK x 8-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.14 

IDT72230 2K x 8-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.14 

IDT72240 4K x 8-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.14 

IDT72221 IK x 9-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.15 

IDT72231 2K x 9-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.15 

IDT72241 4K x 9-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.15 

IDT72235 2K x 18-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.16 

IDT72245 4K x 18-Bit Parallel SyncFlFO™ (Clocked FIFO) 6.16 

IDT72401 64 x 4 FIFO 6.17 

IDT72402 64x5 FIFO 6.17 

IDT72403 64 x 4 FIFO (w/Output Enable) 6.17 

IDT72404 64 x 5 FIFO (w/Output Enable) 6.17 

IDT72413 64 x 5 FIFO (w/Flags) 6.18 

IDT7251 512 x 18-Bit — IK x 9-Bit BiFlFO 6.19 

IDT7252 IK x 18-Bit — 2Kx 9-Bit BiFlFO : 6.19 

IDT72510 512 x 18-Bit — IK x 9-Bit BiFlFO 6.19 

IDT72520 IK x 18-Bit — 2Kx 9-Bit BiFlFO 6.19 

IDT72511 512 x 18-Bit BiFlFO 6.20 

IDT72521 IK x 18-Bit BiFlFO 6.20 

IDT72605 256 x 18-Bit Synchronous BiFlFO (SyncBiFlFO™) 6.21 

IDT72615 512 x 18-Bit Synchronous BiFlFO (SyncBiFlFO™) 6.21 

SPECIALTY MEMORY PRODUCTS 

IDT7130 8K(1Kx 8) Dual-Port RAM (MASTER) 7.1 

IDT7140 8K (1 K x 8) Dual-Port RAM (SLAVE) 7.1 

IDT7030 8K (1 K x 8) Dual-Port RAM (MASTER) 7.2 

IDT7040 8K (1 K x 8) Dual-Port RAM (SLAVE) 7.2 

IDT701 0 9K (1 K x 9) Dual-Port RAM (MASTER) 7.3 

IDT70104 9K (1 K x 9) Dual-Port RAM (SLAVE) 7.3 

IDT701 01 9K (1 K x 9) Dual-Port RAM (MASTER w/lnterrupts) 7.4 

IDT701 05 9K (1 K x 9) Dual-Port RAM (SLAVE w/lnterrupts) 7.4 

IDT7132 16K(2Kx 8) Dual-Port RAM (MASTER) 7.5 

IDT7142 16K (2K x 8) Dual-Port RAM (SLAVE) 7.5 

IDT7032 16K (2K x 8) Dual-Port RAM (MASTER) 7.6 
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SPECIALTY MEMORY PRODUCTS (CONTINUED) 

IDT7042 1 6K (2K x 8) Dual-Port RAM (SLAVE) 7.6 

IDT71321 16K(2Kx 8) Dual-Port RAM (MASTER w/lnterrupts) 7.7 

IDT71421 16K(2Kx 8) Dual-Port RAM (SLAVE w/lnterrupts) 7.7 

IDT7012 18K (2K x 9) Dual-Port RAM 7.8 

IDT70121 18K (2K x 9) Dual-Port RAM (MASTER w/lnterrupts) 7.9 

IDT70125 18K (2K x 9) Dual-Port RAM (SLAVE w/lnterrupts) 7.9 

IDT7133 32K (2K x 16) Dual-Port RAM (MASTER) 7.10 

IDT7143 32K (2K x 16) Dual-Port RAM (SLAVE) 7.10 

IDT7133SA/LA 32K (2K x 16) Dual-Port RAM (MASTER) 7.11 

IDT7143SA/LA 32K (2K x 16) Dual-Port RAM (SLAVE) 7.11 

IDT7134 32K (4K x 8) Dual-Port RAM 7.12 

IDT7134SA/LA 32K (4K x 8) Dual-Port RAM 7.13 

IDT71342 32K (4K x 8) Dual-Port RAM (w/Semaphores) 7.14 

IDT71342SA/LA 32K (4K x 8) Dual-Port RAM (w/Semaphores) 7.15 

IDT7014 32K (4K x 9) Dual-Port RAM 7.16 

IDT7024 64K (4K x 16) Dual-Port RAM 7.17 

IDT7005 64K (8K x 8) Dual-Port RAM 7.18 

I DT7025 1 28K (8K x 1 6) Dual-Port RAM 7. 1 9 

IDT7006 128K (16K x 8) Dual-Port RAM 7.20 

IDT7050 8K (1 K x 8) FourPort™ RAM 7.21 

IDT7052 16K (2K x 8) FourPort™ RAM 7.22 

IDT71502 64K(4Kx 16) Registered RAM (w/SPC™) 7.23 

SUBSYSTEMS PRODUCTS 
MULTI-PORT MODULES 

IDT7M134 8K X 8 Master Dual-Port SRAM Module 8.1 

IDT7M144 8K x 8 Slave Dual-Port SRAM Module 8.2 

IDT7M135 16Kx 8 Master Dual-Port SRAM Module 8.1 

IDT7M145 16K x 8 Slave Dual-Port SRAM Module 8.2 

IDT7M137 32K x 8 Master Dual-Port SRAM Module 8.3 

IDT7M1003 64K X 8 Dual-Port SRAM Module 8.4 

IDT7M1001 128K X 8 Dual-Port SRAM Module 8.4 

IDT7M1004 8K X 9 Dual-Port SRAM Module 8.5 

IDT7M1005 16K x 9 Dual-Port SRAM Module 8.5 

IDT7MB6056 32K x 1 6 Dual-Port (Shared Memory) SRAM Module 8.6 

IDT7MB1008 32K x 16 Dual-Port SRAM Module 8.7 

IDT7MB1006 64K x 16 Dual-Port SRAM Module 8.7 

IDT7MB6046 64K x 16 Dual-Port (Shared Memory) SRAM Module 8.6 

IDT7MB6036 128K x 16 Dual-Port (Shared Memory) SRAM Module 8.6 

IDT7MB6156 32K x 18 Dual-Port (Shared Memory) SRAM Module 8.8 

IDT7MB6146 64K x 18 Dual-Port (Shared Memory) SRAM Module 8.8 

IDT7MB6136 128K x 18 Dual-Port (Shared Memory) SRAM Module 8.8 

IDT7M1002 16K x 32 Dual-Port SRAM Module 8.9 

IDT7MB1041 8K x 8 FourPort™ SRAM Module 8.10 

IDT7MB1 042 4K x 8 FourPort™ SRAM Module 8.10 

IDT7MB1043 4K x 16 FourPort™ SRAM Module 8.11 

IDT7MB1044 2K x 16 FourPort™ SRAM Module 8.11 

FIFO MODULES 

IDT7M205 8K x 9-Bit CMOS FIFO Module 8.12 

IDT7MP2005 8K x 9-Bit FIFO Module 8.13 

IDT7M206 16Kx 9-Bit CMOS FIFO Module 8.12 

IDT7MP2011 16Kx 9 Bit FIFO Module 8.13 

IDT7M207 32K x 9-Bit CMOS FIFO Module 8.14 
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FIFO MODULES (CONTINUED) 

IDT7MP2010 16Kx 18-Bit FIFO Module 8.15 

IDT7MP2009 32K x 18-Bit FIFO Module 8.15 

SRAM MODULES 

IDT7MC4001 1M x 1 CMOS Static RAM Module 8.16 

IDT7M4042 256K x 4 CMOS Static RAM Module 8.17 

IDT7M812 64K x 8 CMOS Static RAM Module 8.18 

IDT8M824S 128K x 8 CMOS Static RAM Module 8.19 

IDT8MP824S 128K x 8 CMOS Static RAM Module 8.20 

IDT8MP824L 128K x 8 CMOS Static RAM Module 8.21 

IDT7MP4034 256K x 8 CMOS Static RAM Module 8.22 

IDT7M4048 512K x 8 CMOS Static RAM Module 8.23 

IDT7MB4048 512K x 8 CMOS Static RAM Module 8.23 

IDT7MP4008S 512K x 8 CMOS Static RAM Module 8.24 

IDT7MP4058L 512K x 8 CMOS Static RAM Module 8.25 

IDT7M912 64K x 9 CMOS Static RAM Module 8.18 

IDT7MB4040 256K x 9 CMOS Static RAM Module 8.26 

IDT7MC4005 16K x 16 CMOS Static RAM Module 8.27 

IDT7MB4009 2(1 6K x 16) CMOS Static RAM Module 8.28 

IDT8M61 2 32K x 1 6 CMOS Static RAM Module 8.29 

IDT8MP612S 32K x 16 CMOS Static RAM Module 8.30 

IDT8MP612L 32K x 16 CMOS Static RAM Module 8.31 

IDT7M624 64K x 16 CMOS Static RAM Module 8.32 

IDT8M624 64K x 1 6 CMOS Static RAM Module 8.29 

IDT8MP624S 64K x 16 CMOS Static RAM Module 8.30 

IDT8MP624L 64K x 1 6 CMOS Static RAM Module 8.31 

IDT7M4016 256K x 16 CMOS Static RAM Module 8.33 

IDT7MP4047 51 2K x 16 CMOS Static RAM Module 8.34 

IDT7MC4032 16K x 32 CMOS Static RAM Module w/Separate Data I/O 8.35 

IDT7MP4031 16Kx 32 CMOS Static RAM Module 8.36 

IDT7M4003 32K x 32 CMOS Static RAM Module 8.37 

IDT7M4017 64K x 32 CMOS Static RAM Module 8.38 

IDT7MP4036 64K x 32 CMOS Static RAM Module 8.39 

IDT7M4013 128K x 32 CMOS Static RAM Module 8.37 

IDT7MP4045 256K x 32 CMOS Static RAM Module 8.40 

CACHE MODULES 

IDT7MB6064 (2 x 4K x 64) Data/Instruction Cache Module tor IDT79R3000 CPU 8.41 

IDT7MB6044 (2 x 4K x 64) Data/lnstruction Cache Module for IDT79R3000 CPU 8.42 

IDT7MB6043 (2 x 8K x 64) Data/Inst ruction Cache Module for IDT79R3000 CPU 8.43 

IDT7MB6051 (2 x 8K x 64) Data/Instruction Cache Module for IDT79R3000 CPU 

(Multiprocessor) 8.44 

IDT7MB6039 (2 x 1 6K x 60) Data/Instruction Cache Module for IDT79R3000 CPU 8.45 

IDT7MB6049 (2 x 16K x 60) Data/Instruction Cache Module for IDT79R3000 CPU 

(Multiprocessor) 8.46 

IDT7MB6040 (2 x 1 6K x 64) Data/Instruction Cache Module for General Purpose CPUs 8.47 

IDT7MB6061 (2 x 16K x 60) Data/Instruction w/Resettable Instruction Tag 8.48 

WRITABLE CONTROL STORE MODULES 

IDT7M6032 16K x 32 Writable Control Store Static RAM Module 8.49 

IDT7MB6042 8K x 1 1 2 Writable Control Store Static RAM Module 8.50 

OTHER MODULES 

Flexi-Pak Family Modules with Various Combinations of SRAMs, EPROMs and EEPROMs 8.51 
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CUSTOM MODULES 

Subsystem Custom Module Capabilities 8.52 

APPLICATION AND TECHNICAL NOTES 

FIFO Products Application Notes 

AN-01 Understanding the IDT7201/7202 FIFO 9.1 

AN-15 Using the IDT72103/104 Serial-Parallel FIFO 9.2 

AN-22 Performance Advantages with IDT's Flagged FIFOs 9.3 

AN-34 General Purpose (1 6-Bit to 8-Bit) BiFlFO Interface 9.4 

AN-36 The BiFlFO Parity Generation and Checking 9.5 

AN-39 The Programmable Flags of BiFIFOs 9.6 

AN-56 The BiFlFO Expansion Configuration 9.7 

AN-57 The BiFlFO Bypass 9.8 

AN-60 Designing with the IDT SyncFlFO™ — The Architecture of the Future 9.9 

AN-69 Depth Expansion of IDT's Synchronous FIFOs Using the Ring Counter 

Approach 9.10 

AN-71 Simplify SCSI Host Adapter Design with Bidirectional FIFO Memories 9.11 

AN-73 Understanding the Output Control OE of the Flagged FIFOs: 

IDT72021/31/41 9.12 

FIFO Products Technical Notes 

TN-06 Designing with FIFOs 9.13 

TN-08 Operating FIFOs on Full and Empty Boundary Conditions 9.14 

TN-09 Cascading FIFOs or FIFO Modules 9.15 

Specialty Memory Products Application Notes 

AN-02 Dual-Port RAMs Simplify Communication in Computer Systems 9.16 

AN-09 Dual-Port RAMs Yield Bit-Slice Designs without Microcode 9.17 

AN-14 Dual-Port RAMs with Semaphore Arbitration .; 9.18 

AN-42 Using the IDT7052 FourPort™ SRAM 9.19 

AN-43 IDT FourPort™ RAM Facilitates Multiprocessor Designs 9.20 

AN-45 Introduction to IDT's FourPort™ RAM 9.21 

AN-59 Using IDT7024 and IDT7025 Dual-Pori Static RAMs to Match System 

Bus Widths 9.22 

AN-67 Using IDT71502 RAMs in a Real-Time Debugging Tool for an R3000 

Microprocessor-based System 9.23 

AN-68 Dual-Port RAM Simplifies PC to TMS320 Interface 9.24 

AN-70 Dual-Port Interrupt Expansion 9.25 

Subsystems Products Application Notes 

AN-44 Design Guidelines for Custom Module Packages 9.26 

AN-74 Understanding Dual-Port Shared Memory Modules 9.27 

AN-75 Using the IDT7M401 7 in an 8-Bit and 16-Bit Wide Organization 9.28 

AN-76 Using the IDT7MB6049 Cache Module with the IDT79R3000 RISC Processor 

in Single or Multiprocessor Systems 9.29 
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STATIC RAM DATA BOOK 

The following is a listing of the data sheets located in the 1991 Static RAM Data Book available under separate 
cover: 


STATIC RAM PRODUCTS PAGE 

CEMOS STATIC RAMS WITH POWER DOWN 

IDT6116 2K x 8 with Power-Down 5.1 

IDT61298 64K x 4 with Output Enable and Power-Down 5.2 

IDT6167 16K x 1 with Power-Down 5.3 

IDT6168 4K x 4 with Power-Down 5.4 

IDT6178 4K x 4 Cache-Tag with Power-Down 5.5 

IDT61970 4K x 4 with Oiutput Enable and Power-Down 5.6 

IDT6198 16K x 4 with Output Enable and Power-Down 5.7 

IDT71024 128K x 8 with Power-Down 5.8 

IDT71028 256K x 4 with Power-Down 5.9 

IDT71256 32K x 8 with Power-Down 5.10 

IDT71 258 64K x 4 with Power-Down 5.1 1 

IDT71259 32K x 9 with Power-Down 5.12 

IDT71281 64K x 4 with Separate I/O and Power-Down 5.13 

IDT71282 64K x 4 with Separate I/O and Power-Down 5.13 

IDT71569 8K x 9 with Address Latch and Power-Down 5.14 

IDT71586 4K x 16 with Address Latch and Power-Down 5.15 

IDT71589 32K x 9 Burst Mode with Power-Down 5.16 

IDT7164 8K x 8 with Power-Down 5.17 

IDT7165 8K x 8 Resettable Power-Down 5.18 

IDT71681 4K x 4 Separate I/O and Power-Down 5.19 

IDT71682 4K x 4 Separate I/O and Power-Down 5.19 

IDT7169 8K x 9 with Power-Down 5.20 

IDT7174 8K x 8 Cache-Tag with Power-Down 5.21 

IDT7187 64K x 1 with Power-Down 5.22 

IDT7188 16K x 4 with Power-Down 5.23 

IDT7198 16K x 4 with Output Enable, 2 CS and Power-Down 5.24 

IDT71981 16K x 4 with Separate I/O and Power-Down 5.25 

IDT71982 16K x 4 with Separate I/O and Power-Down 5.25 

HIGH-SPEED BiCEMOS™ STATIC RAMS 

IDT61 B298 64K x 4 BiCEMOS with Output Enable 6.1 

IDT61B98 16Kx 4 BiCEMOS with Output Enable 6.2 

IDT71B024 128Kx 8 BiCEMOS 6.3 

IDT71B028 256K x 4 BiCEMOS 6.4 

IDT71B221 4K x 18 x 2 BiCEMOS with Self-Timed Latch 6.5 

IDT71B222 4K x 18 x 2 BiCEMOS with Dual Address Latches 6.6 

IDT71B229 16K x 9 x 2 BiCEMOS Cache RAM 6.7 

IDT71 B256 32K x 8 BiCEMOS 6.8 

IDT71B258 64K x 4 BiCEMOS 6.9 

IDT71B556 32K x 8 BiCEMOS with Address Latch 6.10 

IDT71B569 8K x 9 BiCEMOS with Address Latch 6.11 

IDT71B64 8Kx 8 BiCEMOS 6.12 

IDT71 B65 8K x 8 BiCEMOS Resettable 6.13 

IDT71B69 8Kx 9 BiCEMOS 6.14 

IDT71B74 8K x 8 BiCEMOS Cache-Tag 6.15 

IDT71B79 8K x 9 BiCEMOS Cache-Tag 6.16 

IDT71B88 16Kx 4 BiCEMOS 6.17 

IDT71B98 16K x 4 BiCEMOS with Output Enable, 2 CS 6.18 


1.3 


12 



1991 STATIC RAM DATA BOOK (CONTINUED) 


PAGE 


APPLICATION AND TECHNICAL NOTES 
Static RAM Products Application Notes 

AN-05 Separate I/O RAMs Increase Speed and Reduce Part Count 7.1 

AN-07 Cache Tag RAM Chips Simplify Cache Memory Design 7.2 

AN-10 Low-Power and Battery Back-up Operation of CMOS Static RAMs 7.3 

AN-20 Static RAM Timing 7.4 

AN-27 Cache Design Considerations Using the IDT79R3000 7.5 

AN-30 Complete Cache Controller and Cache Memory Design Using 

IDT Standard Parts for the 80386 Processor 7.6 

AN-38 IDT Static RAMs Simplify Cache Design with the 80386 and 82385 7.7 

AN-46 A 33MHz MC68030 Zero-Wait Cache Memory 7.8 

AN-79 25MHz 68020 Cache in Only 19 Chips 7.9 

AN-80 Intel 386/82385 Cached Microprocessor System 7.10 

AN-81 The IDT71586 Burst Cache RAM for the i486 7.11 

Static RAM Products Technical Notes 

TN-04 Using High-Speed 8K x 8 RAMs 7.12 

TN-07 Fast RAMs Give Lowest Power 7.13 

TN-1 1 Cache Timing for the 68020 7.14 

TN-13 Cache Timing for the 80386 7.15 

TN-1 6 Programmable Length Shift Registers Using RAMs and Counters 7.16 
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PART NO. PAGE 

1 00484 4K X 4 ECL 1 00K SRAM (Corner Power) SMP 

1 00490 64K x 1 ECL 1 00K SRAM SMP 

100494 1 6K x 4 ECL 1 00K SRAM SMP 

100496LL 16Kx 4 Self-Timed Latch Input, Latch Output..... SMP 

100496RL 16K x 4 Self-Timed Reg Input, Latch Output SMP 

100497 16K x 4 Synchronous Write, Latch Output SMP 

100498 16K x 4 Conditional Write, Latch Output SMP 

100504 64K x 4 ECL 1 00K SRAM SMP 

100506LL 64K x 4 Self-Timed Latch Input, Latch Output SMP 

100506RL 64K x 4 Self-Timed Reg Input, Latch Output SMP 

1 00507 1 6K x 4 Synchronous Write, Latch Output SMP 

100508 64K x 4 Conditional Write, Latch Output SMP 

1 00509 32K x 9 ECL 1 00K SRAM SMP 

1 00A484 4K x 4 ECL 1 00K SRAM (Center Power) SMP 

101484 4Kx 4 ECL 101K SRAM (Corner Power) SMP 

101490 64Kx1 ECL 101 K SRAM SMP 

101494 16K x 4 ECL 101K SRAM SMP 

101 496LL 1 6K x 4 Self-Timed Latch Input, Latch Output SMP 

1 01 496RL 1 6K x 4 Self-Timed Reg Input, Latch Output SMP 

101497 16K x 4 Synchronous Write, Latch Output SMP 

101498 16K x 4 Conditional Write, Latch Output SMP 

101504 64K x 4 ECL 100K SRAM SMP 

101506LL 64K x 4 Self-Timed Latch Input, Latch Output SMP 

1 01 506RL 64K x 4 Self-Timed Reg Input, Latch Output SMP 

101507 16K x 4 Synchronous Write, Latch Output SMP 

1 01508 64K x 4 Conditional Write, Latch Output SMP 

101509 32K x 9 ECL 1 01 K SRAM SMP 

101A484 4K x 4 ECL 1 01K SRAM (Center Power) SMP 

1 0484 4K X 4 ECL 1 0K SRAM (Corner Power) SMP 

10490 64K x 1 ECL 10K SRAM SMP 

10494 1 6K x 4 ECL 1 0K SRAM SMP 

1 0496LL 1 6K x 4 Self-Timed Latch Input, Latch Output SMP 

1 0496RL 1 6K x 4 Self-Timed Reg Input, Latch Output SMP 

1 0497 1 6K x 4 Synchronous Write, Latch Output SMP 

10498 16K x 4 Conditional Write, Latch Output SMP 

10504 64K x 4 ECL 10K SRAM SMP 

1 0506LL 64K x 4 Self-Timed Latch Input, Latch Output SMP 

10506RL 64K x 4 Self-Timed Reg Input, Latch Output SMP 

10507 64K x 4 Synchronous Write, Latch Output SMP 

10508 64K x 4 Conditional Write, Latch Output SMP 

10509 32K x 9 ECL 10K SRAM SMP 

10A484 4K x 4 ECL 10K SRAM (Center Power) SMP 

29FCT52 Non-inverting Octal Registered Transceiver LOGIC 

29FCT520 Multi-level Pipeline Register LOGIC 

29FCT520T Multi-level Pipeline Register LOGIC 

29FCT521T Multi-level Pipeline Register LOGIC 

29FCT52T Non-inverting Octal Registered Transceiver LOGIC 

29FCT53 Inverting Octal Registered Transceiver LOGIC 

29FCT53T Inverting Octal Registered Transceiver LOGIC 

39C01 4-Bit Microprocessor Slice LOGIC 

39C10 12-Bit Sequencer LOGIC 

39C60 1 6-Bit Cascadable EDC LOGIC 

39C8XX IDT39C8XXX Family LOGIC 

49C25 Microcycle Length Controller LOGIC 
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49C402 1 6-Bit Microprocessor Slice LOGIC 

49C410 1 6-Bit Sequencer LOGIC 

49C460 32-Bit Cascadable EDC LOGIC 

49C465 32-Bit CMOS Flow-ThruEDC Unit LOGIC 

49C466 64-BIT CMOS Flow-ThruEDC Unit LOGIC 

49FCT661 16-Bit Synchronous Binary Counter LOGIC 

49FCT804 High-Speed Tri-Port Bus Multiplexer LOGIC 

49FCT805 Buffer/Clock Driver w/Guaranteed Skew LOGIC 

49FCT806 Buffer/Clock Driver w/Guaranteed Skew LOGIC 

49FCT818 Octal Register with S PC™ LOGIC 

54/74FBT2240 Inverting Octal Buffer/Line Driver w/25£l Series Resistor LOGIC 

54/74FBT2244 Inverting Octal Buffer/Line Driver w/25Q Series Resistor LOGIC 

54/74FBT2373 Octal Transparent Latch w/3-State & 250 Series Resistor LOGIC 

54/74FBT240 Inverting Octal Buffer/Line Driver LOGIC 

54/74FBT241 Non-inverting Octal Buffer/Line Driver LOGIC 

54/74FBT244 Non-inverting Octal Buffer/Line Driver LOGIC 

54/74FBT245 Non-inverting Octal Transceiver LOGIC 

54/74FBT2827 Non-inverting 1 0-Bit Buffers/Driver w/25Q Series Resistor LOGIC 

54/74FBT2828 Invertingl 0-Bit Buffers/Driver w/25ft Series Resistor LOGIC 

54/74FBT2841 10-Bit Memory Latch w/250 Series Resistor LOGIC 

54/74FBT373 Octal Transparent Latch w/3-State LOGIC 

54/74FBT374 Non-inverting Octal D Register LOGIC 

54/74FBT540 Inverting Octal Buffer LOGIC 

54/74FBT541 Non-inverting Octal Buffer LOGIC 

54/74FBT821 1 0-Bit Non-inverting Register LOGIC 

54/74FBT823 9-Bit Inverting Register LOGIC 

54/74FBT827 Non-inverting 10-Bit Buffers/Driver LOGIC 

54/74FBT828 Invertingl 0-Bit Buffers/Driver LOGIC 

54/74FBT841 10-Bit Non-inverting Latch LOGIC 

54/74FCT138 1-of-8 Decoder LOGIC 

54/74FCT138T 1-of-8 Decoder LOGIC 

54/74FCT139 Dual 1-of-4 Decoder LOGIC 

54/74FCT139T Dual 1-of-4 Decoder LOGIC 

54/74FCT151T 8-Input Multiplexer LOGIC 

54/74FCT157T Quad 2-Input Multiplexer LOGIC 

54/74FCT161 Synchronous Binary Counter w/Asynchronous Master Reset LOGIC 

54/74FCT161T Synchronous Binary Counter w/Asynchronous Master Reset LOGIC 

54/74FCT163 Synchronous Binary Counter w/Synchronous Reset LOGIC 

54/74FCT163T Synchronous Binary Counter w/Synchronous Reset LOGIC 

54/74FCT182 Carry Lookahead Generator LOGIC 

54/74FCT191 Up/Down Binary Counter w/Preset and Ripple Clocks LOGIC 

54/74FCT191T Up/Down Binary Counter w/Preset and Ripple Clock LOGIC 

54/74FCT193 Up/Down Binary Counter w/Separate Up/Down Clocks LOGIC 

54/74FCT193T Up/Down Binary Counter w/Separate Up/Down Clocks LOGIC 

54/74FCT240 Inverting Octal Buffer/Line Driver LOGIC 

54/74FCT240T Inverting Octal Buffer/Line Driver LOGIC 

54/74FCT241 Non-inverting Octal Buffer/Line Driver LOGIC 

54/74FCT241T Non-inverting Octal Buffer/Line Driver LOGIC 

54/74FCT244 Non-inverting Octal Buffer/Line Driver LOGIC 

54/74FCT244T Non-inverting Octal Buffer/Line Driver LOGIC 

54/74FCT245 Non-inverting Octal Transceiver LOGIC 

54/74FCT245T Non-inverting Octal Transceiver LOGIC 

54/74FCT251T 8-Input Multiplexer w/3-State LOGIC 

54/74FCT257T Quad 2-Input Multiplexer w/3-State LOGIC 
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54/74FCT273 Octal D Flip-Flop w/Common Master Reset LOGIC 

54/74FCT273T Octal D Flip-Flop w/Common Master Reset LOGIC 

54/74FCT299 8-Input Universal Shift Register w/Common Parallel I/O Pins LOGIC 

54/74FCT299T 8 Input Universal Shift Register w/Common Parallel I/O Pins LOGIC 

54/74FCT373 Non-inverting Octal Transparent Latch LOGIC 

54/74FCT373T Non-inverting Octal Transparent Latch w/3-State LOGIC 

54/74FCT374 Non-inverting Octal D Flip-Flop LOGIC 

54/74FCT374T Non-inverting Octal D Register LOGIC 

54/74FCT377 Octal D Flip-Flop w/Clock Enable LOGIC 

54/74FCT377T Octal D Flip-Flop w/Clock Enable LOGIC 

54/74FCT399 Quad Dual-Port Register LOGIC 

54/74FCT399T Quad Dual-Port Register LOGIC 

54/74FCT521 8-Bit Identity Comparator LOGIC 

54/74FCT521T 8-Bit Identity Comparator LOGIC 

54/74FCT533 Inverting Octal Transparent Latch LOGIC 

54/74FCT533T Inverting Octal Transparent Latch w/3-State LOGIC 

54/74FCT534 Inverting Octal D Flip-Flop w/3-State LOGIC 

54/74FCT534T Inverting Octal D Register LOGIC 

54/74FCT540 Inverting Octal Buffer/Line Driver LOGIC 

54/74FCT540T Inverting Octal Buffer/Line Driver LOGIC 

54/74FCT541 Non-inverting Octal Buffer/Line Driver LOGIC 

54/74FCT541T Non-inverting Octal Buffer/Line Driver LOGIC 

54/74FCT543 Non-inverting Octal Latched Transceiver LOGIC 

54/74FCT543T Non-inverting Octal Latched Transceiver LOGIC 

54/74FCT573 Non-inverting Octal Transparent Latch LOGIC 

54/74FCT573T Non-inverting Octal Transparent Latch w/3-State LOGIC 

54/74FCT574 Non-inverting Octal D Register w/3-State LOGIC 

54/74FCT574T Non-inverting Octal D Register LOGIC 

54/74FCT620T Inverting Octal Bus Transceiver w/3-State LOGIC 

54/74FCT621T Non-inverting Octal Bus Transceiver (Open Drain) LOGIC 

54/74FCT622T Inverting Octal Bus Transceiver (Open Drain) LOGIC 

54/74FCT623T Non-inverting Octal Bus Transceiver w/3-State LOGIC 

54/74FCT640 Inverting Octal Transceiver LOGIC 

54/74FCT640T Inverting Octal Transceiver LOGIC 

54/74FCT645 . Non-inverting Octal Transceiver LOGIC 

54/74FCT645T Non-inverting Octal Transceiver LOGIC 

54/74FCT646 Non-inverting Octal Registered Transceiver LOGIC 

54/74FCT646T Non-inverting Octal Registered Transceiver LOGIC 

54/74FCT648T Inverting Octal Registered Transceiver LOGIC 

54/74FCT651T Inverting Octal Registered Transceiver LOGIC 

54/74FCT652T Non-inverting Octal Registered Transceiver LOGIC 

54/74FCT821 10-Bit Non-inverting Register w/3-State LOGIC 

54/74FCT821T 10-Bit Non-inverting Register w/3-State LOGIC 

54/74FCT823 9-Bit Non-inverting Register w/Clear & 3-State LOGIC 

54/74FCT823T 9-Bit Non-inverting Register w/Clear & 3-State LOGIC 

54/74FCT824 9-Bit Inverting Register w/Clear & 3-State LOGIC 

54/74FCT825 8-Bit Non-inverting Register LOGIC 

54/74FCT825T 8-Bit Non-inverting Register w/Clear & 3-State LOGIC 

54/74FCT827 10-Bit Non-inverting Buffer LOGIC 

54/74FCT827T 10-Bit Non-inverting Buffer LOGIC 

54/74FCT828T 10-Bit Inverting Buffer LOGIC 

54/74FCT833 8-Bit Transceiver w/Parity LOGIC 

54/74FCT841 10-Bit Non-inverting Latch LOGIC 

54/74FCT841T 10-Bit Non-inverting Latch LOGIC 
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54/74FCT843 9-Bit Non-inverting Latch LOGIC 

54/74FCT843T 9-Bit Non-inverting Latch LOGIC 

54/74FCT844 9-Bit Inverting Latch LOGIC 

54/74FCT845 8-Bit Non-inverting Latch LOGIC 

54/74FCT845T 8-Bit Non-inverting Latch LOGIC 

54/74FCT861 10-Bit Non-inverting Transceiver LOGIC 

54/74FCT863 9-Bit Non-inverting Transceiver LOGIC 

54/74FCT864 9-Bit Inverting Transceiver LOGIC 

61 1 6 2K x 8 with Power-Down SRAM 

61298 64K x 4 with Output Enable and Power-Down SRAM 

6167 16K x 1 with Power-Down SRAM 

6168 4K x 4 with Power-Down SRAM 

6178 4K x 4 Cache-Tag with Power-Down SRAM 

61 970 4K x 4 with Oiutput Enable and Power-Down SRAM 

6198 16K x 4 with Output Enable and Power-Down SRAM 

61 B298 64K x 4 BiCEMOS with Output Enable SRAM 

61 B98 1 6K x 4 BiCEMOS with Output Enable SRAM 

7005 64K (8K x 8) Dual-Port RAM SMP 

7006 128K (16K x 8) Dual-Port RAM SMP 

701 0 9K (1 K x 9) Dual-Port RAM (MASTER) SMP 

70101 9K (1 K x 9) Dual-Port RAM (MASTER w/lnterrupts) SMP 

701 04 9K (1 K x 9) Dual-Port RAM (SLAVE) SMP 

701 05 9K (1 K x 9) Dual-Port RAM (SLAVE w/lnterrupts) SMP 

7012 18K (2K x 9) Dual-Port RAM SMP 

70121 18K (2K x 9) Dual-Port RAM (MASTER w/lnterrupts) SMP 

70125 18K (2K x 9) Dual-Port RAM (SLAVE w/lnterrupts) SMP 

7014 32K (4K x 9) Dual-Port RAM SMP 

7024 64K (4K x 16) Dual-Port RAM . .. SMP 

7025 128K (8K x 16) Dual-Port RAM SMP 

7030 8K (IK x 8) Dual-Port RAM (MASTER) SMP 

7032 16K(2Kx 8) Dual-Port RAM (MASTER) SMP 

7040 8K (1 K X 8) Dual-Port RAM (SLAVE) SMP 

7042 16K (2K x 8) Dual-Port RAM (SLAVE) SMP 

7050 8K (IK x 8) FourPort™ RAM SMP 

7052 16K (2K x 8) FourPort™ RAM . SMP 

71024 128K x 8 with Power-Down . SRAM 

71028 256K x 4 with Power-Down SRAM 

71256 32K x 8 with Power-Down SRAM 

71258 64K x 4 with Power-Down SRAM 

71259 32K x 9 with Power-Down SRAM 

71281 64K x 4 with Separate I/O and Power-Down SRAM 

71282 64K x 4 with Separate I/O and Power-Down SRAM 

7130 8K (IK x 8) Dual-Port RAM (MASTER) SMP 

71 32 1 6K (2K x 8) Dual-Port RAM (MASTER) SMP 

71321 16K (2K x 8) Dual-Port RAM (MASTER w/lnterrupts) SMP 

7133 32K (2K x 16) Dual-Port RAM (MASTER) SMP 

7133SA/LA 32K (2K x 16) Dual-Port RAM (MASTER) SMP 

7134 32K (4K x 8) Dual-Port RAM SMP 

71342 32K (4K x 8) Dual-Port RAM (w/Semaphores) SMP 

71342SA/LA 32K (4K x 8) Dual-Pott RAM (w/Semaphores) SMP 

7134SA/LA 32K (4K x 8) Dual-Port RAM SMP 

7140 8K (IK x 8) Dual-Port RAM (SLAVE) SMP 

7142 16K (2K x 8) Dual-Port RAM (SLAVE) SMP 

71421 16K (2K x 8) Dual-Port RAM (SLAVE w/lnterrupts) SMP 
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7143 32K (2K x 16) Dual-Port RAM (SLAVE) SMP 

7143SA/LA 32K (2K x 16) Dual-Port RAM (SLAVE) SMP 

71502 64K (4K x 16) Registered RAM (w/SPC™) SMP 

71569 8K x 9 with Address Latch and Power-Down SRAM 

71586 4K x 16 with Address Latch and Power-Down SRAM 

71589 32K x 9 Burst Mode with Power-Down SRAM 

7164 8K x 8 with Power-Down SRAM 

7165 8K x 8 Resettable Power-Down SRAM 

71681 4K x 4 Separate I/O and Power-Down SRAM 

71682 4K x 4 Separate I/O and Power-Down SRAM 

7169 8K x 9 with Power-Down SRAM 

7174 8K x 8 Cache-Tag with Power-Down SRAM 

7187 64K x 1 with Power-Down SRAM 

7188 16K x 4 with Power-Down SRAM 

7198 16K x 4 with Output Enable, 2 CS and Power-Down SRAM 

71981 16K x 4 with Separate I/O and Power-Down SRAM 

71982 1 6K x 4 with Separate I/O and Power-Down SRAM 

71 B024 1 28K x 8 BiCEMOS SRAM 

71 B028 256K x 4 BiCEMOS SRAM 

71B221 4K x 18 x 2 BiCEMOS with Self-Timed Latch and Power-Down SRAM 

71B222 4K x 18 x 2 BiCEMOS with Dual Address Latches and Power-Down SRAM 

71 B229 1 6K x 9 x 2 BiCEMOS Cache RAM SRAM 

71 B256 32K x 8 BiCEMOS SRAM 

71 B258 64K x 4 BiCEMOS SRAM 

71 B556 32K x 8 BiCEMOS with Address Latch SRAM 

71 B569 8K x 9 BiCEMOS with Address Latch SRAM 

71B64 8Kx 8 BiCEMOS SRAM 

71 B65 8K X 8 BiCEMOS Resettable SRAM 

71 B69 8K x 9 BiCEMOS SRAM 

71 B74 8K X 8 BiCEMOS Cache-Tag SRAM 

71 B79 8K X 9 BiCEMOS Cache-Tag SRAM 

71 B88 1 6K x 4 BiCEMOS SRAM 

71 B98 1 6K x 4 BiCEMOS with Output Enable, 2 CS SRAM 

7200 256 x 9-Bit Parallel FIFO SMP 

7201 51 2 x 9-Bit Parallel FIFO SMP 

7202 1 024 X 9-Bit Parallel FIFO SMP 

72021 1 K x 9-Bit Parallel FIFO w/ Flags and Output Enable SMP 

7203 2K x 9-Bit Parallel FIFO SMP 

72031 2K x 9-Bit Parallel FIFO w/Flags and Output Enable SMP 

7204 4K X 9-Bit Parallel FIFO SMP 

72041 4K x 9-Bit Parallel FIFO w/Flags and Output Enable SMP 

7205 8K x 9-Bit Parallel FIFO SMP 

7206 16Kx 9-Bit Parallel FIFO ! SMP 

721 03 2K x 9-Bit Configurable Parallel-Serial FIFO SMP 

721 04 4K x 9-Bit Configurable Parallel-Serial FIFO SMP 

721 05 256 x 1 6-Bit Parallel-to-Serial FIFO SMP 

721 0L 16x16 Parallel Multiplier-Accumulator LOGIC 

72115 512 x 16-Bit Parallel-to-Serial FIFO SMP 

72125 1024 x 16-Bit Parallel-to-Seria! FIFO SMP 

721 31 2048 x 9-Bit Parallel-to-Serial FIFO SMP 

721 32 2048 x 9-Bit Serial-to-Parallel FIFO SMP 

721 41 4096 x 9-Bit Paraliel-to-Serial FIFO SMP 

721 42 4096 x 9-Bit Serial-to-Paraliel FIFO SMP 

721 6L 16x16 Parallel Multiplier LOGIC 
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721 7L 16 x 16 Parallel Multiplier (32 Bit Output) LOGIC 

72200 256 x 8-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72201 256 x 9-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72210 512 X 8-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

7221 1 51 2 x 9-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72215A 512 x 18-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72220 1 K X 8-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72221 1 K x 9-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72225A 1024 x 18-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72230 2K x 8-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72231 2K x 9-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72235 2K x 18-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72240 4K X 8-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72241 4K x 9-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72245 4Kx 18-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72401 64x4 FIFO SMP 

72402 64 x 5 FIFO SMP 

72403 64 x 4 FIFO (w/Output Enable) SMP 

72404 64 x 5 FIFO (vv/Output Enable) SMP 

72413 64 x 5 FIFO (w/Flags) SMP 

72420 64 X 8-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

72421 64 X 9-Bit Parallel SyncFlFO™ (Clocked FIFO) SMP 

7251 512 x 18-Bit — IK x 9-Bit BiFlFO SMP 

72510 512 x 18-Bit — IK x 9-Bit BiFlFO SMP 

72511 512 x 18-Bit BiFlFO SMP 

7252 IKx 18-Bit — 2Kx 9-Bit BiFlFO SMP 

72520 IKx 18-Bit — 2K x 9-Bit BiFlFO SMP 

72521 IKx 18-Bit BiFlFO SMP 

72605 256 x 18-Bit Synchronous BiFlFO (SyncBiFlFO™) SMP 

72615 512 x 18-Bit Synchronous BiFlFO (SyncBiFlFO™) SMP 

73200L 16-Bit CMOS Multilevel Pipeline Register LOGIC 

73200L 16-Bit CMOS Multilevel Pipeline Registers 6.50 

73201 L 16-Bit CMOS Multilevel Pipeline Register LOGIC 

73201 L 16-Bit CMOS Multilevel Pipeline Registers 6.50 

73210 Fast Octal Register Transceiver w/Parity LOGIC 

73210 Fast CMOS Octal Register Transceiver with Parity 6.60 

7321 1 Fast Octal Register T ransceiver w/Parity LOGIC 

7321 1 Fast CMOS Octal Register Transceiver with Parity 6.60 

7381 L 1 6-Bit CMOS Cascadable ALU LOGIC 

7383L 16-Bit CMOS Cascadable ALU LOGIC 

75C457 CMOS Single 8-Bit PaletteDAC™ tor True Color Applications LOGIC 

75C458 Triple 8-Bit PaletteDAC™ LOGIC 

75C48 8-Bit Flash ADC LOGIC 

75C58 8-Bit Flash ADC with Overflow Output LOGIC 

79R3000A RISC CPU Processor 5.10 

79R3001 RISController™ 5.20 

79R3010A RISC Floating Point Accelerator (FPA) 5.30 

79R3020 RISC CPU Write Buffer 6.40 

79R3051 IDT79R3051 Family of Integrated RISControllers™ 5.50 

79R3500 RISC CPU Processor RISCore™ 5.40 

79R3720 Bus Exchanger for R3051 Family 6.10 

79R3721 DRAM Controller for R3051 Family 6.20 

79R3722 I/O Interface Controller for R3051 Family 6.30 

79R4000 Third Generation MIPS RISC Processor 5.60 
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7M1001 128Kx 8 Dual-Port SRAM Module SMP 

7M1002 16Kx 32 Dual-Port SRAM Module . SMP 

7M1003 64K x 8 Dual-Port SRAM Module SMP 

7M1004 8K x 9 Dual-Port SRAM Module SMP 

7M1005 16Kx 9 Dual-Port SRAM Module SMP 

7M134 8K x 8 Master Dual-Port SRAM Module SMP 

7M135 16Kx 8 Master Dual-Port SRAM Module SMP 

7M137 32K x 8 Master Dual-Port SRAM Module SMP 

7M144 8K x 8 Slave Dual-Port SRAM Module SMP 

7M145 16Kx 8 Slave Dual-Port SRAM Module SMP 

7M205 8K x 9-Bit CMOS FIFO Module SMP 

7M206 1 6K x 9-Bit CMOS FIFO Module SMP 

7M207 32Kx 9-Bit CMOS FIFO Module SMP 

7M4003 32K x 32 CMOS Static RAM Module SMP 

7M4013 128K x 32 CMOS Static RAM Module SMP 

7M401 6 256K x 1 6 CMOS Static RAM Module SMP 

7M4017 64 K x 32 CMOS Static RAM Module SMP 

7M4042 256K x 4 CMOS Static RAM Module SMP 

7M4048 51 2K x 8 CMOS Static RAM Module SMP 

7M6032 16K x 32 Writable Control Store Static RAM Module SMP 

7M624 64Kx 16 CMOS static RAM Module SMP 

7M812 64K x 8 CMOS Static RAM Module SMP 

7M912 64K x 9 CMOS Static RAM Module SMP 

7MB1006 64K X 16 Dual-Port SRAM Module SMP 

7MB1 008 32K X 1 6 Dual-Port SRAM Module SMP 

7MB1041 8K X 8 FourPort™ SRAM Module SMP 

7MB 1042 4K X 8 FourPort™ SRAM Module SMP 

7MB 1043 4Kx 16 FourPort™ SRAM Module SMP 

7MB1044 2K X 16 FourPort™ SRAM Module SMP 

7MB4009 2(1 6K X 1 6) CMOS Static RAM Module SMP 

7MB4040 256K x 9 CMOS Static RAM Module SMP 

7MB4048 51 2K X 8 CMOS Static RAM Module SMP 

7MB6036 128K x 16 Dual-Port (Shared Memory) SRAM Module SMP 

7MB6039 (2 X 1 6K X 60) Data/Instruction Cache Module for IDT79R3000 CPU SMP 

7MB6040 (2 x 16K x 64) Data/Instruction Cache Module for General Purpose CPUs .. SMP 

7MB6042 8K x 1 12 Writable Control Store Static RAM Module SMP 

7MB6043 (2 X 8K X 64) Data/Instruction Cache Module for IDT79R3000 CPU SMP 

7MB6044 (2 x 4K x 64) Data/Instruction Cache Module for IDT79R3000 CPU SMP 

7MB6046 64K x 16 Dual-Port (Shared Memory) SRAM Module SMP 

7MB6049 (2 x 16K x 60) Data/Instruction Cache Module for IDT79R3000 CPU 

(Multiprocessor) SMP 

7MB6051 (2 x 8K x 64) Data/Instruction Cache Module for IDT79R3000 CPU 

(Multiprocessor) SMP 

7MB6056 32K x 1 6 Dual-Port (Shared Memory) SRAM Module SMP 

7MB6061 (2 x 16K x 60) Data/Instruction w/Resettable Instruction Tag SMP 

7MB6064 (2 x 4K x 64) Data/Instruction Cache Module for IDT79R3000 CPU SMP 

7MB6136 128Kx 18 Dual-Port (Shared Memory) SRAM Module SMP 

7MB6146 64K x 18 Dual-Port (Shared Memory) SRAM Module SMP 

7MB6156 32K x 18 Dual-Port (Shared Memory) SRAM Module SMP 

7MC4001 1 M x 1 CMOS Static RAM Module SMP 

7MC4005 16Kx 16 CMOS Static RAM Module SMP 

7MC4032 16K x 32 CMOS Static RAM Module w/Separate Data I/O SMP 

7MP2005 8K x 9-Bit FIFO Module SMP 

7MP2009 32K x 18-Bit FIFO Module SMP 
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7MP2010 16Kx 18-Bit FIFO Module SMP 

7MP2011 16K x 9 Bit FIFO Module SMP 

7MP4008S 51 2K X 8 CMOS Static RAM Module SMP 

7MP4031 1 6K x 32 CMOS Static RAM Module SMP 

7MP4034 256K x 8 CMOS Static RAM Module SMP 

7MP4036 64K x 32 CMOS Static RAM Module SMP 

7MP4045 256K x 32 CMOS Static RAM Module SMP 

7MP4047 51 2Kx 16 CMOS static RAM Module SMP 

7MP4058L 512K x 8 CMOS Static RAM Module SMP 

7RS101 R3000 CPU Modules for General Applications 7.10 

7RS102 R3000 CPU Modules for Compact Systems 7.20 

7RS103 R3000 CPU Modules for Compact Systems 7.30 

7RS104 R3001 RISC Engine for Embedded Controllers 7.40 

7RS107 R3000 CPU Modules for High Performance and Multiprocessor Systems ... 7.50 

7RS1 08 R3000 CPU Modules with 256K Caches 7.60 

7RS1 09 R3000 CPU Modules with 256K Caches 7.70 

7RS1 1 0 Plug Compatible Family of R3000 CPU Modules 7.80 

7RS300 Series Prototyping Platform for Any IDT RISC CPU Module 8.20 

7RS363 R3000 PGA Adaptor 8.30 

7RS364 R3000 Disassembler for Use with the HP 1 6500 Logic Analyzer 8.40 

7RS382 R3000 and R3001 Evaluation Boards 8.50 

7RS383 R3000 and R3001 Evaluation Boards 8.50 

7RS388 REALS™ R3000 Laser Printer Controller Evaluation System 8.60 

7RS502 MacStation 2 R3000 Development System 8.70 

7RS503 MacStation 3 R3000 Development System 8.80 

7RS901 IDT/sim System Integration Manager ROMable Debugging Kernal 8.90 

7RS903 IDT/c Multi-Host C-Compiler System 8.10 

7RS904 Cross Assembler for IBM PCs and Clones 8.1 1 

7RS905 IDT/fp Floating Point Library for Use with R3000 Compilers 8.12 

8M612 32Kx 16 CMOS Static RAM Module SMP 

8M624 64K x 1 6 CMOS Static RAM Module SMP 

8M824S 1 28K X 8 CMOS Static RAM Module SMP 

8MP612L 32Kx 16 CMOS Static RAM Module SMP 

8MP612S 32Kx 16 CMOS Static RAM Module SMP 

8MP624L 64Kx 16 CMOS Static RAM Module SMP 

8MP624S 64K x 16 CMOS Static RAM Module SMP 

8MP824L 128Kx 8 CMOS Static RAM Module SMP 

8MP824S 128K x 8 CMOS Static RAM Module SMP 

Flexi-Pak Family Modules with Various Combinations of SRAMs, EPROMs and EEPROMs .. SMP 

RC32xx IDT RISC Development Host Systems 8.10 

Subsystem Custom Module Capabilities SMP 

Third Party Development Support 8.13 
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IDT PACKAGE MARKING DESCRIPTION 


PART NUMBER DESCRIPTION 

IDT’s part number identifies the basic product, speed, 
power, package(s) available, operating temperature and 
processing grade. Each data sheet has a detailed description , 
using the part number, for ordering the proper product for the 
user's application. The part number is comprised of a series 
of alpha-numeric characters: 

1. An “IDT” corporate identifier for Integrated Device 
Technology, Inc. 

2. A basic device part number composed of alpha-numeric 
characters. 

3. A device power identifier, composed of one or two alpha 
characters, is used to identify the power options. In most 
cases, the following alpha characters are used: 

“S” or “SA” is used for the standard product’s power. 

“L" or “LA” is used for lower power than the standard 
product. 


4. A device speed identifier, when applicable, is either alpha 
characters, such as “A” or “B”, or numbers, such as 20 or 
45. The speed units, depending on the product, are in 
nanoseconds or megahertz. 

5. A package identifier, composed of one or two characters. 
The data sheet should be consulted to determine the 
packages available and the package identifiers for that 
particular product. 

6. Atemperature/process identifier. The product is available 
in either the commercial or military temperature range, 
processed to a commercial specification, orthe product is 
available in the military temperature range with full 
compliance to MIL-STD-883. Many of IDT’s products 
have burn-in included as part of the standard commercial 
process flow. 

7. A special process identifier, composed of alpha characters, 
is used for products which require radiation enhancement 
(RE) or radiation tolerance (RT). 


Example for Monolithic Devices: 

IDT XXX.. .XXX XX X..X X...X X XX 

Special Process 
Process/T emperature* 

Package* 

Speed 
Power 

Device Type* 

* Field Identifier Applicable To All Products 

2507 drwOI 

ASSEMBLY LOCATION DESIGNATOR MIL-STD-883C COMPLIANT DESIGNATOR 

IDT uses various locations for assembly. These are IDTshipsmilitaryproductswhicharecomplianttothelatest 
identified by an alpha character in the last letter of the date revision of MIL-STD-883C. Such products are identified by a 
code marked on the package. Presently, the assembly “C’’designationonthepackage. The location of this designator 
location alpha character is as follows: is specified by internal documentation at IDT. 

A = Anam, Korea 
I = USA 

P = Penang, Malaysia 
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IDT... LEADING THE CMOS FUTURE 


A major revolution is taking place in the semiconductor 
industry today. A new technology is rapidly displacing older 
NMOS and bipolar technologies as the workhorse of the 80’s 
and beyond. Thattechnology is high-speed CMOS. Integrated 
Device Technology, a company totally predicated on and 
dedicated to implementing high-performance CMOS products, 
is on the leading edge of this dramatic change. 

Beginning with the introduction of the industry’s fastest 
CMOS 2K x 8 static RAM, IDT has grown into a company with 
multiple divisions producing a wide range of high-speed 
CMOS circuits that are, in almost every case, the fastest 
available. These advanced products are produced with IDT’s 
proprietary CEMOS™ technology, a twin-well, dry-etched, 
stepper-aligned process utilizing progressively smaller 
dimensions. 

From inception, IDT’s product strategy has been to apply 
the advantages of it’s extremely fast CEMOS technology to 
produce the integrated circuit elements required to implement 
high-performance digital systems. IDT’s goal is to provide the 
circuits necessary to create systems which are far superior to 
previous generations in performance, reliability, cost weight 
and size. Many of the company’s innovative product designs 
offer higher levels of integration, advanced architectures, 
higher density packaging and system enhancement features 
that are establishing tomorrow’s industry standards. The 
company is committed to providing its customers with an ever- 
expanding seriesof these high-speed, lower-power 1C solutions 
to system design needs. 

IDT’s commitment, however, extends beyond state-of-the- 
art technology and advanced products to providing the highest 


level of customer service and satisfaction in the industry. 
Producing products to exacting quality standards that provide 
excellent, long-term reliability is given the same level of 
importance and priority as device performance. IDT is also 
dedicated to delivering these high-quality advanced products 
on time. The company would like to be known not only for its 
technological capabilities, but also for providing its customers 
with quick, responsive and courteous service. 

I DT’s product families are available in both commercial and 
military grades. As a bonus, commercial customers obtain the 
benefits of military processing disciplines, established to meet 
or exceed the stringent criteria of the applicable military 
specifications. 

IDT is the leading U.S. supplier of high-speed CMOS 
circuits. The company’s high-performance fast SRAM , FCT 
logic family, high-density modules, FIFOs, complex logic 
products, specialty memories, ECL I/O BiCEMOS™ memories, 
RISC subsystems, and the 32-bit RISC microprocessorfamily 
complement each otherto provide high-speed CMOS solutions 
to a wide range of applications and systems. 

Dedicated to maintaining its leadership position as a state- 
of-the-art 1C manufacturer, IDT will continue to focus on 
maintaining its technology edge as well as developing a 
broader range of innovative products. New products and 
speed enhancements are continuously being added to each 
of the existing product families and additional product lines will 
be introduced. Contact your IDT field representative orfactory 
marketing engineer to determine the latest product offerings, 
if you’re building state-of-the-art equipment, IDT wants to help 
you solve some of your design problems. 



IDT MILITARY AND DESC-SMD PROGRAM 


IDT is a leading supplier of military, high-speed CMOS 
circuits. The company’s high-performance Static RAMs, FCT 
Logic Family, Complex Logic (CLP), FIFOs, Specialty 
Memories (SMP), ECL I/O BiCMOS Memories, 32-bit RISC 
Microprocessor, RISC Subsystems and high-density 
Subsystems Modules product lines complement each other to 
provide high-speed CMOS solutions to a wide range of 
military applications and systems. Most of these product lines 
offer Class B products which are fully compliant to the latest 
revision of MIL-STD-883, Paragraph 1.2.1. In addition, IDT 
offers RadiationTolerant (RT), aswell as Radiation Enhanced 
(RE), products. 

IDT has an active program with the Defense Electronic 
Supply Center (DESC) to list all of IDT's military compliant 


devices on Standard Military Drawings (SMD). The SMD 
program allows standardization of militarized products and 
reduction of the proliferation of non-standard source control 
drawings. This program will go far toward reducing the need 
for each defense contractor to make separate specification 
control drawings for purchased parts. IDT plans to have 
SMDs for many of its product offerings. Presently, IDT has 88 
devices which are listed or pending listing. The devices are 
from IDT’s SRAM, FCT Logic family, Complex Logic (CLP), 
FIFOs and Specialty Memories (SMP) product families. IDT 
expects to add another 20 devices to the SMD program in the 
near future. Users should contact either IDT or DESC for 
current status of products in the SMD program. 
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RADIATION HARDENED TECHNOLOGY 


IDT manufactures and supplies radiation hardened products 
for military/aerospace applications. Utilizing special processing 
and starting materials, IDT’s radiation hardened devices are 
ableto survive in hostile radiation environments. Intotaldose, 
dose rate and environments where single event upset is of 
concern, IDT products are designed to continue functioning 
without loss of performance . I DT can supply all its products on 
these processes. Total Dose radiation testing is performed in- 


house on an ARACOR X-Ray system. External facilities are 
utilized for device research on gamma cell, LINAC and other 
radiation equipment. IDT has an on-going research and 
development program for improving radiation handling 
capabilities (See “IDT Radiation Tolerant/Enhanced Products 
for Radiation Environments” in Section 3) of IDT products/ 
processes. 
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IDT LEADING EDGE CEMOS TECHNOLOGY 


HIGH-PERFORMANCE CEMOS 

From IDT’s beginnings in 1980, it has had a belief in and a 
commitment to CMOS. The company developed a high- 
performance version of CMOS, called enhanced CMOS 
(CEMOS), that allows the design and manufacture of leading- 
edge components. It incorporates the best characteristics of 
traditional CMOS, including low power, high noise immunity 


and wide operating temperatu re range ; it also achieves speed 
and output drive equal or superior to bipolar Schottky TTL. 
The last decade has seen development and production of four 
"generations” of IDT’s CEMOS technology with process 
improvements which have reduced IDT’s electrical effective 
(Left) gate lengths by more than 50 percent from 1 .3 microns 
(millionths of a meter) in 1981 to 0.6 microns in 1989. 



CEMOS 1 

CEMOS II 

A C 

CEMOS III 

CEMOS V 

CEMOS VI 

Calendar Year 

1981 

1983 

1985 

1987 

1989 

1990 

Drawn 

Feature Size 

2.5)i 

1.7)i 

1.3)i 

1.2)i 

1.0)1 

0.8u 

Leff 

1.3)1 

1.1)i 

0.9p. 

0.8)i 

0.6)1 

0.45)i 

Basic 

Process 

Enhancements 

Dual-well, 
Wet Etch, 
Projection 
Aligned 

Dry Etch, 
Stepper 

Shrink, 

Spacer 

Silicide, 
BPSG, 
BiCEMOS 1 

BiCEMOS II 

BiCEMOS III 


2514 drw 01 

CEMOS IV = CEMOS III - scaled process optimized for high-speed logic. 


Figure 1. 


Continual advancement of CEMOS technology allows IDT 
to implement progressively higher levels of integration and 
achieve increasingly fasterspeeds maintaining the company’s 
established position as the leader in high-speed CMOS 
integrated circuits. In addition, the fundamental process 
technology has been extended to add bipolar elements to the 
CEMOS platform. IDT's BiCEMOS process combines the 
ultra-high speeds of bipolar devices with the lower power and 
cost of CMOS, allowing us to build even faster components 
than straight CMOS at a slightly higher cost. 



Figure 2. Fifteen-Hundred-Power Magnification Scanning Electron 
Microscope (SEM) Photos of the Four Generations of IDFs CEMOS 
Technology 
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Figure 3. IDT CEMOS Device Cross Section Figure 4. IDT CEMOS Built-In High Alpha Particle Immunity 


ALPHA PARTICLES 

Random alpha particles can cause memory cells to 
temporarily lose theircontentsorsuffera“soft error.” Traveling 
with high energy levels, alpha particles penetrate deep into an 
integrated chip. As they burrow into the silicon, they leave a 
trail of free electron-hole pairs in their wake. 

The cause of alpha particles is well documented and 
understood in the industry. IDT has considered various 
techniquesto protect the cells from this hazardous occurrence. 
These techniques include dual-well structures (Figures 3 and 
4) and a polymeric compound for die coating. Presently, a 
polymeric compound is used in many of IDTs SRAMs; however, 
the specific techniques used may vary and change from one 
device generation to the next as the industry and IDT improve 
the alpha particle protection technology. 

LATCHUP IMMUNITY 

A combination of careful design layout, selective use of 
guard rings and proprietary techniques have resulted in virtual 
elimination of latchup problems often associated with older 
CMOS processes (Figure 5). The use of NPN and N-channel 
I/O devices eliminates hole injection latchup. Double guard 
ring structures are utilized on all input and output circuits to 
absorb injected electrons. These effectively cut off the current 
paths into the internal circuits to essentially isolate I/O circuits. 
Compared to older CMOS processes which exhibit latchup 
characteristics with trigger currents from 10-20mA, IDT 
products inhibit latchup at triggercurrents substantially greater 
than this. 



(a) Section A-A 01234567 

(b) Collector Supply Voltage Vfcc (V) 
Typical 
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Figure 5. IDT CEMOS Latchup Suppression 
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SURFACE MOUNT TECHNOLOGY 
AND 

IDT’S MODULE PRODUCTS 


Requirements for circuit area reduction, utilizing the most 
efficient and compact component placement possible and the 
needs of production manufacturing for electronics assemblies 
are the driving forces behind the advancement of circuit-board 
assembly technologies. These needs are closely associated 
with the advances being made in surface mount devices 
(SMD) and surface mount technology (SMT) itself. Yet, there 
are two major issues with SMT in production manufacturing of 
electronic assemblies: high capital expenditures and com- 
plexity of testing. 

The capital expenditure required to convert to efficient 
production using SMT is still too high for the majority of 
electronics companies, regardless of the 20-60% increase in 
the board densities which SMT can bring. Because of this high 
barrier to entry, we will continue to see a large market segment 
[large even compared to the exploding SMT market] using 
traditional through-hole packages (i.e. DIPs, PGAs, etc) and 
assembly techniques. How can these types of companies 
take advantage of SMD and SMT? Let someone else, such 
as IDT, do it for them by investing time and money in SMT and 
then in return offer through-hole products utilizing SMT proc- 
esses. Products which fit this description are multi-chip 
modules, consisting of SMT assembled SMDs on a through- 
hole type substrate. Modules enable companies to enjoy SMT 
density advantages and traditional package options without 
the expensive startup costs required to do SMT in-house. 

Although subcontracting this type of work to an assembly 
house is an alternative, there still is the other issue of testing, 
an area where many contract assembly operations fall short 
of IDT's capability and experience. Prerequisites for ade- 
quate module testing sophisticated high performance par- 
ametric testers, customized test fixtures, and most impor- 
tantly the experience to tests today’s complex electronic 
devices. Companies can therefore take advantage of IDT’s 
experience in testing and manufacturing high performance 
CMOS multi-chip modules. 

At IDT, SMD components are electrically tested, environ- 
mentally screened, and performance selected for each IDT 
module. All modules are 1 00% tested as if they are a separate 
functional component and are guaranteed to meet all speci- 
fied parameters at the module output without the customer 
having to understand the modules’ internal workings. 


Other added benefits companies get by using IDT’s CMOS 
module products are: 

1 ) a wide variety of high performance, through-hole prod- 
ucts utilizing SMD packaged components, 

2) fast speeds compared with NMOS based products, 

3) low power consumption compared with bipolar tech- 
nologies, and 

4) low cost manufacturability compared with GaAs based 
products. 

IDT has recognized the problems of SMT and began 
offering CMOS modules as part of its standard product port- 
folio. IDT modules combine the advantages of: 

1) the low power characteristics of IDT’s CEMOS™ and 
BiCEMOS™ products, 

2) the density advantages of first class SMD components 
including those from IDT’s components divisions, and 

3) experience in system level design, manufacturing, and 
testing with its own in-house SMT operation. 

IDT currently has two divisions (Subsystems and RISC 
Subsystems) dedicated to the development of module prod- 
ucts ranging from simple memory modules to complex VME 
sized application specific modules to full system level CPU 
boards. These modules have surface mount devices as- 
sembled on both sides of either a multi-layer glass filled epoxy 
(FR-4) ora multi-layer co-fired ceramic substrate. Assembled 
modules come available in industry standard through-hole 
packages and other space-saving module packages. Indus- 
try proven vapor-phase or IR reflow techniques are used to 
solder the SMDs to the substrate during the assembly proc- 
ess. Because of our affiliation with IDT’s experienced semi- 
conductor manufacturing divisions, we thoroughly under- 
stand and therefore test all modules to the applicable da- 
tasheet specifications and customer requirements. 

Thus, IDT is able to offer today’s electronic design engi- 
neers a unique solution for their “need-more-for-less” 
problem.modules. These high speed, high performance 
products offer the density advantages of SMD and SMT, the 
added benefit of low power CMOS technology, and through- 
hole packaged electronics without the high cost of doing it in- 
house. 
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STATE-OF-THE-ART FACILITIES AND CAPABILITIES 


Integrated Device Technology is headquartered in Santa 
Clara, California — the heart of the “Silicon Valley.” The 
company’s operations are housed in seven facilities totaling 
over 500,000 square feet. These facilities house all aspects 
of business from research and development to design, wafer 
fabrication, assembly, environmental screening, test and 
administration. In-house capabilities include scanning electron 
microscope (SEM) evaluation, particle impact noise detection 
(PIND), plastic and hermetic packaging, military and 
commercial testing, burn-in, life test and a full complement of 
environmental screening equipment. 

The over-200, 000-square-foot corporate headquarters 
campus is composed of four buildings. The largest facility on 
this site is a 100,000 square foot, two-building complex. The 
first building, a 60,000 square foot facility, is dedicated to the 
Complex Logic, Standard Logic and RISC Microprocessor 
product lines, as well as hermetic and plastic package 
assembly, logic products’ test, burn-in, mark and QA, and a 
reliability/failure analysis lab. 

IDT’s Packaging and Assembly Process Development 
teams are located here. To keep pace with the development 
of new products and to enhance the IDT philosophy of 
“Innovation,” these teams have ultra modern, integrated and 
correspondingly sophisticated equipment and environments 
at their disposal. All manufacturing is completed in dedicated 
clean room areas (Class 10K minimum), with all preseal 
operations accomplished underClass 1 00 laminarf low hoods. 

Development of assembly materials, processes and 
equipment is accomplished undera fully operational production 
environment to ensure reliability and repeatable product. The 
Hermetic Manufacturing and Process Development team is 
currently producing customproductsto the strict requirements 
of M I L-STD-883. The fully automated plastic facility is currently 
producing high volumes of USA-manufactured product, while 
developing state-of-the-art surface mount technology patterned 
after MIL-STD-883. 

The second building of the complex houses sales, marketing, 
finance and MIS. 

The RISC Subsystems and Subsystems Modules Divisions 
are located behind the two-building complex in a 54,000 
square foot facility. Also located at this facility are Quality 
Assurance and wafer fabrication services. 

Directly across the street from the two-building complex is 
a newly acquired 50,000 square foot facility that houses 


administrative services, Northwest Area Sales, Human 
Resources, International Planning and Shipping and Receiving 
functions. 

IDT's largest and newest facility, opened in 1990 in San 
Jose, California, is a multi-purpose 1 50,000 square foot, ultra 
modern technology development center. This facility houses 
a 25,000 square foot, combined Class 1 (a maximum of one 
particle percubic foot of 0.2 micron or larger), sub-half-micron 
R&D fabrication facility and a wafer fabrication area. This fab 
supports both production volumes of IDT products, including 
some next generation SRAMs, and the R&D efforts of the 
technology development staff. T echnology development efforts 
targeted for the center include advanced silicon processing 
and wafer fabrication techniques. A test area to support both 
production and research is located on-site. The building is 
also the new home of the FIFO and ECL product lines. 

IDT’s second largest facility is located in Salinas, California, 
about an hour away from Santa Clara. This 95,000 square 
foot facility, located on 14 acres, is the Static RAM Division 
and Specialty Memory product line. Constructed in 1 985, this 
facility houses an ultra-modern 25,000 square foot high- 
volume wafer fabrication area measured at Class 2-to-3 (a 
maximum of 2 to 3 particles per cubic foot of 0.2 micron or 
larger) clean room conditions. Careful design and construction 
of this fabrication area created a clean room environment far 
beyond the 1985 average for U.S. fab areas. This made 
possible the production of large volumes of high-density 
submicron geometry, fast static RAMs. This facility also 
houses shipping areas for IDT’s leadership family of CMOS 
static RAMs. This site will expand to accommodate a 250,000 
square foot complex. 

To extend these philosophies while maintaining strict control 
of our processes, IDT has an operational Assembly and Test 
facility located in Penang, Malaysia. This facility assembles 
product to USA standards, with all assemblies done under 
laminar flow conditions (Class 1 00) until the silicon is encased 
in its final packaging. All products in this facility are 
manufactured to the quality control requirements of MIL-STD- 
883. 

All of IDT’s facilities are aimed at increasing our 
manufacturing productivity to supply ever larger volumes of 
high-performance, cost-effective leadership CMOS products. 



SUPERIOR QUALITY AND RELIABILITY 


Maintaining the highest standards of quality in the industry 
on all products is the basis of Integrated Device Technology’s 
manufacturing systems and procedures. From inception, 
quality and reliability are built into all of IDT’s products. Quality 
is “designed in” at every stage of manufacturing - as opposed 
to being “tested-in” later - in order to ensure impeccable 
performance. 

Dedicated commitment to fine workmanship, along with 
development of rigid controls throughout wafer fab, device 
assembly and electrical test, create inherently reliable products. 
Incoming materials are subjected to careful inspect ions. Quality 
monitors, or inspections, are performed throughout the 
manufacturing flow. 

IDT military grade monolithic hermetic products are designed 
to meet or exceed the demanding Class B reliability levels of 
MIL-STD-883 and MIL-M-38510, as defined by Paragraph 
1.2.1 of MIL-STD-883. 

Product flow and test procedures for all monolithic hermetic 
military grade products are in accordance with the latest 
revision and notice of MIL-STD-883. State-of-the-art production 
techniques and computer-based test procedures are coupled 
with tight controls and inspections to ensure that products 
meet the requirements for 100% screening. Routine quality 
conformance lot testing is performed as defined in MIL-STD- 
883, Methods 5004 and 5005. 

For IDT module products, screening of the fully assembled 
substrates is performed, in addition to the monolithic level 
screening, to assure package integrity and mechanical 


reliability. All modules receive 100% electrical tests (DC, 
functional and dynamic switching) to ensure compliance with 
the "subsystem" specifications. 

By maintaining these high standards and rigid controls 
throughout every step of the manufacturing process, IDT 
ensuresthat commercial, industrial and military grade products 
consistently meet customerrequirementsforquality, reliability 
and performance. 

SPECIAL PROGRAMS 

Class S. IDT also has all manufacturing, screening and 
test capabilities in-house (except X-ray and some Group D 
tests) to perform complete Class S processing per MIL-STD- 
883 on all IDT products and has supplied Class S products on 
several programs. 

Radiation Hardened. IDT has developed and supplied 
several levels of radiation hardened products for military/ 
aerospace applications to perform at various levels of dose 
rate, total dose, single event upset (SEU), upset and latchup. 
IDT products maintain nearly their same high-performance 
levels built to these special process requirements. The 
company has in-house radiation testing capability used both 
in process development and testing of deliverable product. 
IDT also has a separate group within the company dedicated 
to supplying products for radiation hardened applications and 
to continue research and development of process and products 
to further improve radiation hardening capabilities. 
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QSP-QUALITY, SERVICE AND PERFORMANCE 


Quality from the beginning, is the foundation for IDT’s 
commitment to supply consistently high-quality products to 
our customers. IDT’s quality commitment is embodied in its all 
pervasive Constant Quality Improvement (CQI) program. 
Everyone who influences the quality of the product-from the 
designer to the shipping clerk-is committed to constantly 
improving the product quality. 


These systems and controls concentrate on CQI by focusing 
on the following key elements: 

Statistical Techniques 

Using statisticaltechniques, including Statistical Process 
Control (SPC) to determine whether the product/ 
processes are under control. 


IDT'S FOCUS 

“To make quantitative constant improvement in the quality 
of our actions that result in the supply of leadership products 
in conformance to the requirements of our customers.” 

IDT has dedicated its efforts to constant quantitative 
improvements in quality. The result, a supply of leadership 
products that conform to the requirements of our customers. 


IDT’S PRODUCT ASSURANCE STRATEGY 
FOR CQI 

Measurable standards are essential to the success of CQI. 
All the processes contributing to the final quality of the product 
need to be monitored, measured and improved upon through 
the use of statistical tools. 


DEVELOPMENT 

I 

FAB 

I 

ASSEMBLY 

I 

TEST 

I 

SHIP 


PRODUCT FLOW 


Standardization 

Implementing policies, procedures and measurement 
techniques that are common across different operational 
areas. 

Documentation 

Documenting and training in policies, procedures, 
measurement techniques and updating through 
characterization/ capability studies. 

Productivity Improvement 

Using constant improvement teams made up from 
employees at all levels of the organization. 

Leadership 

Focusing on quality as a key business parameter and 
strategic strength. 

Total Employee Participation 

Incorporating the CQI program into the IDT Corporate 
Culture. 

Customer Service 

Supporting the customer, as a partner, through 
performance review and pro-active problem solving. 


Ourcustomers receive the benefit of ouroptimized systems. 
Installed to enhance quality and reliability, these systems 
provide accurate and timely reporting on the effectiveness of 
manufacturing controls and the reliability and quality 
performance of IDT products and services. 


SERVICE FLOW 


ORDER ENTRY 

I 

PRODUCTION CONTROL 

I 

SHIPPING 

I 

CUSTOMER SUPPORT 


People Excellence 

Committing to growing, motivating and retaining people 
throughtraining, goal setting, performance measurement 
and review. 

PRODUCT FLOW 

Product quality starts here. IDT has mechanisms and 
procedures in place that monitor and control the quality of our 
development activities. Fromthe calibration of designcapture 
libraries through process technology and product 
characterization that establish whether the performance, 
ratings and reliability criteria have been met. This includes 
failure analysis of parts that will improve the prototype product. 

At the pre-production stage once again in-house qualification 
tests assure the quality and reliability of the product. All 
specifications and manufacturing flows are established and 
personnel trained before the product is placed into production. 



Manufacturing 

To make CQI during the manufacturing stage, control items 
are determined for major manufacturing conditions. Data is 
gathered and statistical techniques are used to control specific 
manufacturing processes that affect the quality of the product. 

In-process and final inspections are fed back to earlier 
processes to improve product quality. All product is burned- 
in (where applicable) before 100% inspection of electrical 
characteristics takes place. 

Products which pass final inspection are then subject to 
Quality Assurance and Reliability Tests. This data is used to 
improve manufacturing processes and provide reliability 
predictions of field applications. 

Inventory and Shipping 

Controls in shipping focus on ensuring parts are identified 
and packaged correctly. Care is also taken to see that the 
correct paperwork is present and the product being shipped 
was processed correctly. 

SERVICE FLOW 

Quality not only applies to the product but to the quality -of 
-service we give our customers. Service is also constantly 
improved. 

Order Procedures 

Checks are made at the order entry stage to ensure the 
correct processing of the Customer’s product. After verification 
and data entry the Acknowledgements (sent to Customers) 
are again checked to ensure details are correct. As part of the 
CQI program, the results of these verifications are analyzed 
using statistical techniques and corrective actions are taken. 

Production Control 

Production Control (P.C.) is responsible for the flow and 
logistics of material as it moves through the manufacturing 
processes. The quality of the actions taken by P.C. greatly 
impinges on the quality of service the customer receives. 
Because many of our customers have implemented Just-in- 
Time (JIT) manufacturing practices, IDT as a supplier also has 


to adopt these same disciplines. As a result, employees 
receive extensive training and the performance level of key 
actions are kept under constant review. These key actions 
include: 

Quotation response and accuracy. 

Scheduling response and accuracy. 

Response and accuracy of Expedites. 

Inventory, management, and effectiveness. 

On time delivery. 

Customer Support 

I DT has a worldwide network of sales offices and Technical 
Development Centers. These provide local customer support 
on business transactions, and in addition, support customers 
on applications information, technical services, benchmarking 
of hardware solutions, and demonstration of various 
Development Workstations. 

The key to CQI is the timely resolution of defects and 
implementation of the corrective actions. This is no more 
important than when product failures are found by a 
customer. When failures are found at the customer’s incoming 
inspection, in the production line, or the field application, the 
Quality Assurance group is the focal point for the investigation 
of the cause of failure and implementation of the corrective 
action. IDT constantly improves the level of support we give 
our customers by monitoring the response time to customers 
that have detected a product failure. Providing the customer 
with an analysis of the failure, including corrective actions and 
the statistical analysis of defects, brings CQI full circle-full 
support of our customers and their designs with high-quality 
products. 

SUMMARY 

In 1990, IDT made the commitment to “Leadership through 
Quality, Service, and Performance Products". 

We believe by following that credo IDT and ourcusotmers 
will be successful in the coming decade. With the 
implementation of the CQI strategy, we will satisfy our goal... 

"Leadership through Quality, Service and Performance 

Products". 
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IDT QUALITY CONFORMANCE PROGRAM 


A COMMITMENT TO QUALITY 

Integrated Device Technology’s monolithic and modular 
assembly products are designed, manufactured and tested in 
accordance with the strict controls and procedures required 
by Military Standards. The documentation, design and 
manufacturing criteria of the Quality and Reliability Assurance 
Program were developed and are being maintained to the 
most current revisions of MIL-3851 0 as defined by paragraph 
1.2.1 of MIL-STD-883 and MIL-STD-883 requirements. 

Product flow and test procedures for all Class B monolithic 
hermetic Military Grade microcircuits are in full compliance 
with paragraph 1.2.1 of MIL-STD-883. State-of-the-art 
production techniques and computer-based test procedures 
are coupled with stringent controls and inspections to ensure 
that products meet the requirements for 1 00% screening and 
quality conformance tests as defined in MIL-STD-883, Methods 
5004 and 5005. 

Product flow and test procedures for all plastic and 
commercial hermeticproducls are in accordance with industry 
practices for producing highly reliable microcircuits to ensure 
that products meet the IDT requirements for 1 00% screening 
and quality conformance tests. 

By maintaining these high standards and rigid controls 
throughout every step of the manufacturing process, IDT 
ensures that our products consistently meet customer 
requirements for quality, reliability and performance. 

SUMMARY 

Monolithic Hermetic Package Processing Flow 0 ’ 

Refer to the Monolithic Hermetic Package Processing Flow 
diagram. All test methods refer to MIL-STD-883 unless 
otherwise stated. 

1. Wafer Fabrication: Humidity, temperature and 
particulate contamination levels are controlled and 
maintained accordingtocriteriapattemed after Federal 
Standard 209, Clean Room and Workstation 
Requirements. All critical workstations are maintained 
at Class 100 levels or better. 

Wafers from each waferfabrication area are subjected 
to Scanning Electron Microscope analysis on a periodic 
basis. 

2. Die-Sort Visual Inspection: Wafers are cut and 
separated and the individual die are 100% visually 
inspected to strict IDT-defined internal criteria. 

3. Die Shear Monitor: To ensure die attach integrity, 
product samples are routinely subjected to a shear 
strength test per Method 2019. 


4. Wire Bond Monitor: Product samples are routinely 
subjected toastrengthtestperMethod 2011, Condition 
D, to ensure the integrity of the lead bond process. 

5. Pre-Cap Visual: Before the completed package is 
sealed, 100% of the product is visually inspected to 
Method 2010, Condition B criteria. 


6 . 


Environmental Conditioning: 100% of the sealed 
product is subjected to environmental stress tests. 
These thermal and mechanical tests are designed to 
eliminate units with marginal seal, die attach or lead 
bond integrity. 



7. Hermetic Testing : 1 00% of the hermetic packages 

are subjected to fine and gross leak seal tests to 
eliminate marginally sealed units or units whose 
seals may have become defective as a result of 
environmental conditioning tests. 


8. Pre-Burn-In Electrical Test: Each product is 1 00% 
electrically tested at an ambienttemperature of +25°C 
to IDT data sheet or the customer specification. 


9. Burn-In: 100% of the Military Grade product is 
burned-in under dynamic electrical conditions to the 
time and temperature requirements of Method 1015, 
Condition D. Except for the time, Commercial Grade 
product is burned-in as applicable to the same 
conditions as Military Grade devices. 


10. Post-Burn-In Electrical: After burn-in, 1 00% of the 
Class B Military Grade product is electrically tested to 
IDT data sheet or customer specifications over the 
-55°C to +1 25°C temperature range. Commercial 
Grade products are sample tested to the applicable 
temperature extremes. 


1 1 . Mark: All product is marked with product type and lot 
code identifiers. MIL-STD-883 compliant Military 
Grade products are identified with the required 
compliant code letter. 

12. Quality Conformance Tests: Samples of the Military 
Grade product which have been processed to the 
100% screening tests of Method 5004 are routinely 
subjected to the quality conformance requirements of 
Method 5005. 


NOTE: 

1 . For quality requirements beyond Class B levels such as SEM analysis, X-Ray inspection, Particle Impact Noise Reduction (PIND) test, Class S screening 
or other customer specified screening flows, please contact your Integrated Device Technology sales representative. 


3.2 


1 



Monolithic Hermetic Package Processing Flow 

MIL-STD-883 

TEST METHOD AND CONDITIONS 
WAFER FAB I IDT SPEC 


(SEE NOTE 1) 


INCOMING 
PREFORM 
(SAMPLE) \2J — * 
INCOMING 
PACKAGE (QJ — •> 
(SAMPLE) 


J WAFER PREP FOR ASSEMBLY 
1 DIE-SORT VISUAL INSPECTION 


4— (q) die optical 

^ INSPECTION sample 


DIE ATTACH 


INCOMING 
WIRE (SAMPLE) 


<2H 


W— (o) die attach 

1 PUSH TEST 

SAMPLE 


WIRE BOND 


M O) LEAD BOND PULL 
TEST SAMPLE 


PRE-CAP VISUAL 


INCOMING 
LIDS (SAMPLE) 




*—(q) pre-cap visual 

W SAMPLE 


SEAL 

BACK MARK 
TEMP CYCLE 
CENTRIFUGE 

FINE LEAK TEST 
GROSS LEAK TEST 


FORM 


AS 


REQUIR ED 
> 1 I TRIM 


ATTACH 

BUMPER 


AS 

REQUIRED 




HERMETICITY SAMPLE: 
(SEE NOTE 3) 


] EXTERNAL VISUAL 


PRE BURN-IN 
ELECTRICAL TEST 


2019 

1 

(>2.5kg) 

1 

IDT SPEC 

1 

1 

2011 

1 

(>3.0grams) 

1 

2010 

COND. B 

1 

2010 

COND. B 


IDT SPEC 

I 

IDT SPEC PROVIDES LOT 


TRACEABILITY 


1010 COND. C, 10 cycles 
I -65°C TO +150°C 
2001 COND. E, Y1 Direction 
>30kg (PKG < 5g) 

>20kg (PKG > 5g) 

1014 COND. A or B, 

I <5.0 X 10"® ATM/CC/SEC. 
1014 COND. C 


IDT SPEC 


FINE LEAK: GROSS LEAK: 

COND. A or B COND. C 
1014 <5.0 X 10 -8 
I ATM/CC/SEC. 

IDT SPEC 


5004 DC, AC, FUNCTIONAL @ +25°C 
| (SEE NOTE 2) 


SEE FINAL PROCESSING FLOW FOR REMAINDER OF OPERATIONS AND NOTES 
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SUMMARY 

Monolithic Plastic Package Processing Flow 

Refer to the Monolithic Plastic Package Processing Flow 
diagram. All test methods refer to MIL-STD-883 unless 
otherwise stated. 

1. Wafer Fabrication: Humidity, temperature and 
particulate contamination levels are controlled and 
maintained according to criteria patterned after Federal 
Standard 209, Clean Room and Workstation 
Requirements. Allcritical workstations are maintained 
at Class 100 levels or better. 


6. Post Mold Cure: Plastic encapsulated devices are 
baked to ensure an optimum plastic seal so as to 
enhance moisture barrier characteristics. 

7. Pre-Burn-In Electrical: Each product is 100% 
electrically tested at an ambienttemperature of +25°C 
to IDT data sheet or the customer specification. 

8. Burn-In: Except for MSI Logic family devices where 
it may be obtained as an option, all Commercial 
Grade plastic package products are burned-in 16 
hours at +125°C (or equivalent), utilizing the same 
burn-in conditions as the Military Grade product. 


Topside silicon nitride passivation is all applied to all 
wafers for better moisture barrier characteristics. 

Wafersfromeachwaferfabrication area are subjected 
to Scanning Electron Microscope analysis on a periodic 
basis. 


9. 


Post-Burn-In Electrical: After burn-in, 100% of the 
plastic product is electrically tested to IDT data sheet 
or customer specifications at the maximum 
temperature extreme. The minimum temperature 
extreme is tested periodically on an audit basis. 



2. Die-Sort Visual Inspection: Wafers are 100% 
visually inspected to strict IDT defined internal 
criteria. 

3. Die Push Test: To ensure die attach integrity, 
product samples are routinely subjected to die push 
tests. 

4. Wire Bond Monitor: Product samples are routinely 
subjected to wire bond pull tests to ensure the integrity 
of the lead bond process. 


10. Mark: All product is marked with product type and lot 
code identifiers. 

11. Quality Conformance Inspection: Samples of the 
plastic product which have been processed to the 
100% screening requirements are subjected to the 
Periodic Quality Conformance Inspection Program. 
Where indicated, the test methods are patterned after 
MIL-STD-883 criteria. 


5. Pre-Cap Visual: Before the package is molded, 
100% of the product is visually inspected to criteria 
patterned after MIL-STD-883, Method 2010, Condition 
B. 
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Monolithic Plastic Package Processing Flow 


WAFER FABRICATION (SEE NOTE 1) 




INCOMING LEAD FRAME VIS/DIIW ^ * © OPTICAL INSPECTION SAMPLE 

SAMPLE (Q) ► 


INCOMING DIE ATTACH EPOXY (o)— J 


SAMPLE 


WAFER PREP FOR ASSEMBLY 

DIE-SORT VISUAL 

DIE OPTICAL INSPECTION 


FRAME LOAD 


3 EPOXY DIE ATTACH AND CURE 


INCOMING GOLD WIRE ^ * © DIE ATTACH PUSH TEST SAMPLE 

SAMPLE (0) ► 

THERMOSONIC WIRE BOND 

LEAD BOND PULL TEST SAMPLE 


r-® 


PRE-CAP OPTICAL INSPECTION 


INCOMING MOLDING COMPOUND + © PRE-CAP OPTICAL SAMPLE INSPECTION 

SAMPLE (Q ) ► 

I I ENCAPSULATION/MOLD 


3 CHEMICAL DEFUSH 


BURN-IN BIASED/DYNAMIC AT +125"C, 
160 HRS. MAXIMUM TO 16 HRS. 
MINIMUM (OR EQUIVALENT) ON ALL 
PRODUCTS EXCEPT MSI LOGIC 
FAMILY DEVICES (FCT, FCT-T and FBT),| 
ON WHICH IT MAY BE OBTAINED AS 
AN OPTION. 


J POST MOLD CURE 
3 MECHANICAL DEFUSH AND TRIM 
] SOLDER PUTE 
3 LEAD FORM/SINGULATION 
3 OPEN/SHORT TEST SAMPLE 
3 EXTERNAL VISUAL 

3 PRE-BURN-IN ELECTRICAL TEST +25°C (SEE NOTE 2) 
3 BURN-IN 


POST BURN-IN ELECTRICAL TEST +70°C (SEE NOTE 2) 

PDA | £10% 

H°> ELECTRICAL TEST QUALITY SAMPLE +70°C (SEE NOTE 2) 
] TOPSIDE MARK 


3 LEAD STRAIGHTEN 


] 100% ELECTRICAL TEST +25°C (SEE NOTE 2) 
PDA | £5% (SEE NOTE 2) 

SHIPPING INSPECTION GATE 


NOTE: 

1. All screens are 100% unless otherwise noted. 

2. All electrical test programs are per the applicable IDT test specification. 

3. IDT performs a 100% electrical test at +25°C with a 5% PDA limit at this point. 

4. ( 5 ) = Quality sample inspection. 
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Monolithic Hermetic Package Final Processing Flow 


Operation 

MIL-STD-883 

Test Method 

Military 
Compliant 
Class B 

Burn-In 

1015/D at +125°C 

Min. or Equivalent 

100% 

160 Hours 

Post Burn-In Electrical: Static (DC), Functional and 
Switching (AC) (2) 

IDT Spec. 

100% 

+25, -55 and 
+125°C 

Percent Defective Allowed (PDA)' 4 ’ 

5004 or IDT Spec. 

5% 

Group A Electrical: Static (DC), Functional and 
Switching (AC) (2) 

5005 and IDT Spec. 

Sample 

-55 and +125°C 

Mark/Lead Straighten 

IDT Spec. 

100% 

+25°C Electrical' 2 ) 

IDT Spec. 

100%< s > 

Final Visual/Pack 

IDT Spec. 

100% 

Quality Conformance Inspection 

5005 (Group B, C, D) 

Yes 

Quality Shipping Inspection 
(Visual/Plant Clearance) 

IDT Spec. 

Sample 


Commercial 

Military 

Commercial 

Temp. Range 

Temp. Range 

100% 

100% 

16 to 160 Hours 

16to160 Hours 

1 00% 

100% 

+125°C 

+70°C 

10% 

10% 

Sample 

Sample 

+125°C 

+70°C 

100% 

100% 

100% 

100% 

100% 

100% 

— 

— 

Sample 

Sample 




NOTES: 

1 . All screens are 100% unless otherwise noted. 

2. All electrical test programs are per the applicable IDT test specification. 

3. This hermeticity sample is performed after all lead finish operations. 

4. If a lot fails the 5% PDA but is SI 0%, the lot may be resubmitted to burn-in one time only to the same time and temperature conditions as first submission. 
The subsequent post bum-in electrical test at +25°C will be performed to a PDA of 3%. 

5. IDT performs a 100% electrical test at +25°C with a 2% PDA limit at this point to satisfy group A requirements, and considers this to be equivalent to the 
group A requirement of an LTPD of 2, with an accept number of 0. If a lot fails the 2% PDA limit, it may be rescreened one time only to a tightened PDA 
limit of 1.5%. 

6. (O) = Quality sample inspection. 
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RADIATION TOLERANT/ENHANCED/HARDENED PRODUCTS FOR 
RADIATION ENVIRONMENTS 


INTRODUCTION 

The need for high-performance CMOS integrated circuits 
in military and space systems is more critical today than ever 
before. The low power dissipation that is achieved using 
CMOS technology, along with the high complexity and density 
levels, makes CMOS the nearly ideal component for all types 
of applications. 

Systems designed for military or space applications are 
intended for environments where high levels of radiation may 
be encountered. The implication of a device failure within a 
military or space system clearly is critical. IDT has made a 
significant contribution toward providing reliable radiation- 
tolerant systems by offering integrated circuits with enhanced 
radiation tolerance. Radiation environments, IDT process 
enhancements and device tolerance levels achieved are 
described below. 

THE RADIATION ENVIRONMENT 

There are four different types of radiation environments 
that are of concern to builders of military and space systems. 
These environments and their effects on the device operation, 
summarized in Figure 1, are as follows: 

Total Dose Accumulation refers to the total amount of 
accumulated gamma rays experienced by the devices in the 
system, and is measured in RADS (SI) for radiation units 
experienced at the silicon level. The physical effect of gamma 
rays on semiconductor devices is to cause threshold shifts ( Vt 
shifts) of both the active transistors as well as the parasitic field 
transistors. Threshold voltages decrease as total dose is 
accumulated; at some point, the device will begin to exhibit 
parametric failures as the input/output and supply currents 
increase. At higher radiation accumulation levels, functional 
failures occur. In memory circuits, however, functional failures 
due to memory cell failure often occur first. 

Burst Radiation or Dose Rate refers to the amount of 
radiation, usually photons or electrons, experienced by the 
devices in the system due to a pulse event, and is measured 
in RADS (SI) per second. The effect of a high dose rate or 
burst of radiation on CMOS integrated circuits is to cause 
temporary upset of logic states and/or CMOS latch-up. Latch- 
up can cause permanent damage to the device. 

Single Event Upset (SEU) is a transient logic state change 
caused by high-energy ions, such as energetic cosmic rays, 
striking the integrated circuits. As the ion passes through the 
silicon, charge is either created through ionization or direct 
nuclear collision. If collected by a circuit node, this excess 
charge can cause a change in logic state of the circuit. 
Dynamic nodes that are not actively held at a particular logic 
state (dynamic RAM cellsforexample) are the most susceptible. 
These upsets are transient, but can cause system failures 
known as “soft errors.” 


Neutron Irradiation will cause structural damage to the 
silicon lattice which may lead to device leakage and, ultimately, 
functional failure. 


Radiation 

Category 

Primary 

Particle 

Source 

Effect 

Total Dose 

Gamma 

Space or 

Nuclear 

Event 

Permanent 

Dose Rate 

Photons 

Nuclear 

Event 

Temporary 

Upset of Logic 
State or 

Latch-up 

SEU 

Cosmic 

Rays 

Space 

Temporary 

Upset of 

Logic State 

Neutron 

Neutrons 

Nuclear 

Event 

Device Leakage 
Due to Silicon 
Lattice Damage 
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Figure 1. 


DEVICE ENHANCEMENTS 

Of the four radiation environments above, IDT has taken 
considerable data on the first two, Total Dose Accumulation 
and Dose Rate. I DT has developed a process that significantly 
improves the radiation tolerance of its devices within these 
environments. Prevention of SEU failures is usually 
accomplished by system-level considerations, such as Error 
Detection and Correction (EDC) circuitry, since the occurrence 
of SEUs is not particularly dependent on process technology. 
Through IDT’s customer contracts, SEU has been gathered 
on some devices. Little is yet known about the effects of 
neutron-induced damage. For more information on SEU 
testing, contact IDT’s Radiation Hardened Product Group. 

Enhancements to IDT’s standard process are used to 
create radiation enhanced and tolerant processes. Field and 
gate oxides are “hardened” to make the device less susceptible 
to radiation damage by modifying the process architecture to 
allow lowertemperature processing. Device implants and Vts 
adjustments allow more Vt margin. In addition to process 
changes, IDT’s radiation enhanced process utilizes epitaxial 
substrate material. The use of epi substrate material provides 
a lower substrate resistance environment to create latch-up 
free CMOS structures. 
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RADIATION HARDNESS CATEGORIES 

Radiation Enhanced (‘RE) or Radiation Tolerant (‘RT) 
versions of IDT products follow IDT’s military product data 
sheets whenever possible (consult factory) . IDT’s Total Dose 
Test plan exposes a sample of die on a wafer to a particular 
Total Dose level via ARACOR X-Ray radiation. This Total 
Dose Test plan qualifies each ‘RE or ‘RT waferto a Total Dose 
level. Only wafers with sampled die that pass Total Dose level 
tests are assembled and used for orders (consult factory for 
more details on Total Dose sample testing). With regard to 
Total Dose testing, clarifications/exceptions to MIL-STD-883, 
Methods 5005 and 1019 are required. Consult factory for 
more details. 

The ‘RE and ‘RT process enhancements enable IDT to 
offer integrated circuits with varying grades of radiation 
tolerance or radiation “hardness”. 

• Radiation Enhanced process uses Epi wafers and is able 
to provide devices that can be Total Dose qualified to 10K 
RADs (Si) or greater by IDT’s ARACOR X-Ray Total Dose 
sample die test plan (Total Dose levels require negotiation, 
consult factory for more details). 

• Radiation Tolerant product uses standard wafer/process 
material that is qualified to 10K RADs (Si) Total Dose by 
IDT’s ARACOR X-Ray Total Dose sample die test plan. 


Integrated Device Technology can provide Radiation 
Tolerant/Enhanced versions of all product types (some speed 
grades may not be available as ‘RE). 

Please contact your IDT sales representative or factory 
marketing to determine availability and price of any IDT 
product processed in accordance with one of these levels of 
radiation hardness. 


CONCLUSION 

There has been widespread interest within the military and 
space community in IDT’s CMOS product line for its radiation 
hardness levels, as well as its high-performance and low 
power dissipation. To serve this growing need for CMOS 
circuits that must operate in a radiation environment, IDT has 
created a separate group within the company to concentrate 
on supplying products for these applications.Continuing 
research and development of process and products, including 
the use of in-house radiation testing capability, will allow 
Integrated Device Technology to offercontinuously increasing 
levels of radiation-tolerant solutions. 
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THERMAL PERFORMANCE CALCULATIONS FOR IDT’S PACKAGES 


Since most of the electrical energy consumed by 
microelectronic devices eventually appears as heat, poor 
thermal performance of the device or lack of management of 
this thermal energy can cause a variety of deleterious effects. 
This device temperature increase can exhibit itself as one of 
the key variables in establishing device performance and long 
term reliability; on the other hand, effective dissipation of 
internally generated thermal energy can, if properly managed, 
reduce the deleterious effects and improve component 
reliability. 

A few key benefits of IDT’s enhanced CEMOS™ process 
are: low power dissipation, high speed, increased levels of 
integration, wider operating temperature ranges and lower 
quiescent power dissipation. Because the reliability of an 
integrated circuit is largely dependent on the maximum 
temperature the device attains during operation, and as the 
junction stability declines with increases in junction temperature 
(Tj), it becomes increasingly important to maintain a low (Tj). 

CMOS devices stabilize more quickly and at greatly lower 
temperature than bipolar devices under normal operation. 
The accelerated aging of an integrated circuit can be expressed 
as an exponential function of the junction temperature as: 



where 


tA = lifetime at elevated junction (Tj) temperature 
to = normal lifetime at normal junction (To) temperature 
Ea = activation energy (ev) 
k = Boltzmann’s constant (8.617 x 10~ 5 ev/k) 

i.e. the lifetime of a device could be decreased by a factor of 
2 for every 1 0°C increase temperature. 

To minimize the deleterious effects associated with this 
potential increase, IDT has: 

1. Optimized our proprietary low-power CEMOS 
fabrication process to ensure the active junction 
temperature rise is minimal. 

2. Selected only packaging materials that optimize heat 
dissipation, which encourages acoolermnning device. 

3. Physically designed all package components to 
enhance the inherent material properties and to take 
full advantage of heat transfer and radiation due to 
case geometries. 


4. Tightly controlled the assembly procedures to meet or 
exceed the stringent criteria of MIL-STD-883_to ensure 
maximum heat transfer between die and packaging 
materials. 

The following figures graphically illustrate the thermal values 
of IDT’s current package families. Each envelop (shaded 
area) depicts a typical spread of values due to the influence of 
a number of factors which include: circuit size, package cavity 
size and die attach integrity. The following range of values are 
to be used as a comprehensive characterization of the major 
variables rather than single point of reference. 

When calculating junction temperature (Tj), it is necessary 
to know the thermal resistance of the package (0ja) as 
measured in "degree Celsius per watt”. With the accompanying 
data, the following equation can be used to establish thermal 
performance, enhancedevice reliability and ultimately provide 
you, the user, with a continuing series of high-speed, low- 
power CMOS solutions to your system design needs. 

Oja = [Tj - Ta]/P 

Tj = Ta + P[0JA] = Ta + P[0JC + 0CA] 
where 

0jc = Tj - Tc 0 ca = Tc - Ta 

P P 

0 = Thermal resistance 

J = Junction 

P = Operational power of device (dissipated) 

Ta = Ambient temperature in degree Celsius 
Tj = Temperature of the junction 
Tc = Temperature of case/package 
0ca = Case to Ambient, thermal resistance — usually a 
measure of the heat dissipation due to natural or 
forced convection, radiation and mounting 
techniques. 

Ojc = Junction to Case, thermal resistance — usually 
measured with reference to the temperature at a 
specific point on the package (case) surface. 
(Dependent on the package material properties 
and package geometry.) 

0JA = Junction to Ambient, thermal resistance — usually 
measured with respect to the temperature of a 
specified volume of still air. (Dependent on 0jc + 
0ja which includes the influence of area and 
environmental condition.) 


Ref. MIL-STD-883C, Method 1012.1 

JEDEC ENG. Bulletin No. 20, January 1975 

1986 Semi. Std., Vol. 4, Test Methods G30-86, G32-86. 
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Thermal Resistance of Ceramic DIP Packages 
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PACKAGE DIAGRAM OUTLINE INDEX 

PAGE 

MONOLITHIC PACKAGE DIAGRAM OUTLINES 


PKG. DESCRIPTION 

P16-1 16-Pin Plastic DIP (300 mil) 35 

P18-1 18-Pin Plastic DIP (300 mil) 36 

P20-1 20-Pin Plastic DIP (300 mil) 36 

P22-1 22-Pin Plastic DIP (300 mil) 35 

P24-1 24-Pin Plastic DIP (300 mil) 36 

P24-2 24-Pin Plastic DIP (600 mil) 37 

P28-1 28-Pin Plastic DIP (600 mil) 37 

P28-2 28-Pin Plastic DIP (300 mil) 35 

P32-1 32-Pin Plastic DIP (600 mil) 37 

P32-2 32-Pin Plastic DIP (300 mil) 35 

P40-1 40-Pin Plastic DIP (600 mil) 37 

P48-1 48-Pin Plastic DIP (600 mil) 37 

P64-1 64-Pin Plastic DIP (900 mil) 37 

D1 6-1 1 6-Pin CERDIP (300 mil) 1 

D18-1 18-Pin CERDIP (300 mil) 1 

D20-1 20-Pin CERDIP (300 mil) 1 

D22-1 22-Pin CERDIP (300 mil) 1 

D24-1 24-Pin CERDIP (300 mil) ; 1 

D24-2 24-Pin CERDIP (600 mil) 2 

D28-1 28-Pin CERDIP (600 mil) 2 

D28-2 28-Pin CERDIP (wide body) 2 

D28-3 28-Pin CERDIP (300 mil) 1 

D32-1 32-Pin CERDIP (wide body) 2 

D40-1 40-Pin CERDIP (600 mil) 2 

D40-2 40-Pin CERDIP (wide body) 2 

C20-1 20-Pin Sidebraze DIP (300 mil) 3 

C22-1 22-Pin Sidebraze DIP (300 mil) 3 

C24-1 24-Pin Sidebraze DIP (300 mil) 3 

C24-2 24-Pin Sidebraze DIP (600 mil) 5 

C28-1 28-Pin Sidebraze DIP (300 mil) 3 

C28-2 28-Pin Sidebraze DIP (400 mil) 4 

C28-3 28-Pin Sidebraze DIP (600 mil) 5 

C32-1 32-Pin Sidebraze DIP (600 mil) 5 

C32-2 32-Pin Sidebraze DIP (400 mil) 4 

C32-3 32-Pin Sidebraze DIP (300 mil) 3 

C40-1 40-Pin Sidebraze DIP (600 mil) 5 

C48-1 48-Pin Sidebraze DIP (400 mil) 4 

C48-2 48-Pin Sidebraze DIP (600 mil) 5 

C64-1 64-Pin Sidebraze DIP (900 mil) : 6 

C64-2 64-Pin Topbraze DIP (900 mil) 7 

C68-1 68-Pin Sidebraze DIP (600 mil) 5 

PG68-2 68-Lead Plastic Pin Grid Array (cavity up) 49 

PG84-2 84-Lead Plastic Pin Grid Array (cavity up) 49 

PG208-2 208-Lead Plastic Pin Grid Array (cavity up) 49 

G68-1 68-Lead Pin Grid Array (cavity up) 22 

G68-2 68-Lead Pin Grid Array (cavity down) 28 

G84-1 84-Lead Pin Grid Array (cavity up — 1 2 x 1 2 grid) 23 

G84-2 84-Lead Pin Grid Array (cavity down) 29 

G84-3 84-Lead Pin Grip Array (cavity up — 1 1 x 1 1 grid) 24 
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MONOLITHIC PACKAGE DIAGRAM OUTLINES (Continued) 

PKG. DESCRIPTION 

G84-4 84-Lead Pin Grid Array (cavity down — R301 OA) 30 

G1 08-1 108-Lead Pin Grid Array (cavity up) 25 

G1 44-1 144-Lead Pin Grid Array (cavity down) 31 

G 144-2 144-Lead Pin Grid Array (cavity up) 26 

G208-1 208-Lead Pin Grid Array (cavity up — R3001) 27 

G208-2 208-Lead Pin Grid Array (cavidy down) 34 

SOI 6-1 1 6-Pin Small Outline 1C (gull wing) 38 

SOI 6-2 1 6-Pin Small Outline 1C (J-bend) 41 

SOI 6-5 1 6-Pin Small Outline 1C (El AJ — .031 5 pitch) 40 

SOI 6-6 1 6-Pin Small Outline 1C (EIAJ — .050 pitch) 40 

SOI 8-1 18-Pin Small Outline 1C (gull wing) 38 

SOI 8-6 18-Pin Small Outline 1C (EIAJ — .050 pitch) 40 

SO20-1 20-Pin Small Outline 1C (J-bend) 41 

SO20-2 20-Pin Small Outline 1C (gull wing) 38 

SO20-5 20-Pin Small Outline 1C (EIAJ — .0315 pitch) 40 

SO20-6 20-Pin Small Outline 1C (EIAJ — .050 pitch) 40 

S024-2 24-Pin Small Outline 1C (gull wing) 38 

S024-3 24-Pin Small Outline 1C (gull wing) 38 

S024-4 24-Pin Small Outline 1C (J-bend) 41 

S024-5 24-Pin Small Outline 1C (EIAJ — .031 5 pitch) 40 

S024-6 24-Pin Small Outline 1C (EIAJ — .050 pitch) . 40 

S028-2 28-Pin Small Outline 1C (gull wing) 39 

S028-3 28-Pin Small Outline 1C (gull wing) 39 

S028-4 28-Pin Small Outline 1C (J-bend — 350 mil) 42 

S028-5 28-Pin Small Outline 1C (J-bend — 300 mil) 42 

S028-6 28-Pin Small Outline 1C (EIAJ — .050 pitch) 40 

S032-2 32-Pin Small Outline 1C (J-bend) 42 

S048-1 48-Pin Small Outline 1C (SSOP — gull wing) 43 

S056-1 56-Pin Small Outline 1C (SSOP — gull wing) 43 

J18-1 18-Pin Plastic Leaded Chip Carrier (rectangular) 48 

J20-1 20-Pin Plastic Leaded Chip Carrier (square) 47 

J28-1 28-Pin Plastic Leaded Chip Carrier (square) 47 

J32-1 32-Pin Plastic Leaded Chip Carrier (rectangular) 48 

J44-1 44-Pin Plastic Leaded Chip Carrier (square) 47 

J52-1 52-Pin Plastic Leaded Chip Carrier (square) 47 

J68-1 68-Pin Plastic Leaded Chip Carrier (square) 47 

J84-1 84-Pin Plastic Leaded Chip Carrier (square) 47 

L20-1 20-Pin Leadless Chip Carrier (rectangular) 21 

L20-2 20-Pin Leadless Chip Carrier (square) 19 

L22-1 22-Pin Leadless Chip Carrier (rectangular) 21 

L24-1 24-Pin Leadless Chip Carrier (rectangular) 21 

L28-1 28-Pin Leadless Chip Carrier (square) 19 

L28-2 28-Pin Leadless Chip Carrier (rectangular) 21 

L32-1 32-Pin Leadless Chip Carrier (rectangular) 21 

L44-1 44-Pin Leadless Chip Carrier (square) 19 

L48-1 48-Pin Leadless Chip Carrier (square) 19 

L52-1 52-Pin Leadless Chip Carrier (square) 20 

L52-2 52-Pin Leadless Chip Carrier (square) 20 

L68-1 68-Pin Leadless Chip Carrier (square) 20 

L68-2 68-Pin Leadless Chip Carrier (square) 20 


.2 


2 



PAGE 


MONOLITHIC PACKAGE DIAGRAM OUTLINES (Continued) 


PKG. DESCRIPTION 

El 6-1 16-Lead CERPACK 

E20-1 20-Lead CERPACK 

E24-1 24-Lead CERPACK 

E28-1 28-Lead CERPACK 

E28-2 28-Lead CERPACK 


CQ68-1 68-Lead CERQUAD (straight leads) 

CQ84-1 84-Lead CERQUAD (J-bend) 


F20-1 

F20-2 

F24-1 

F28-1 

F28-2 

F48-1 

F64-1 

F68-1 

F68-2 

F84-1 

F84-2 

FI 72-1 

FI 72-2 


20-Lead Flatpack 

20-Lead Flatpack (.295 body) 

24-Lead Flatpack 

28-Lead Flatpack 

28-Lead Flatpack 

48-Lead Quad Flatpack 

64-Lead Quad Flatpack 

68-Lead Quad Flatpack 

68-Lead Quad Flatpack (straight leads) 

84-Lead Quad Flatpack (cavity down) 

84-Lead Quad Flatpack (cavity up) 

172-Lead Quad Flatpack (cavity up — R3001) 

172-Lead Quad Flatpack (cavity down — R3000A) 


16 

16 

16 

16 

16 

17 

18 

8 

8 

8 

8 

8 

9 

9 

10 
11 
12 

13 

14 

15 



PQ80-2 80-Lead Plastic Quad Flatpack (IEAJ) 45 

PQ1 00-1 1 00-Lead Plastic Quad Flatpack (JEDEC) 44 

PQ100-2 100-Lead Plastic Quad Flatpack (EIAJ) 45 

PQ120-2 120-Lead Plastic Quad Flatpack (EIAJ) 45 

PQ128-2 128-Lead Plastic Quad Flatpack (EIAJ) 45 

PQ132-1 132-Lead Plastic Quad Flatpack (JEDEC) 44 

PQ144-2 144-Lead Plastic Quad Flatpack (EIAJ) 46 

PQ160-2 160-Lead Plastic Quad Flatpack (EIAJ) 46 

PQ184-2 184-Lead Plastic Quad Flatpack (EIAJ) 46 

PQ208-2 208-Lead Plastic Quad Flatpack (EIAJ) 46 


.2 
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DUAL IN-LINE PACKAGES 



i II- 


bl 


_L 

SI — 

f— 

A ifir 

SEATING PLANE — i — *— 

1 L 

MMfli 

1 1 

T~L| 

— b S — 




NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. THE MINIMUM LIMIT FOR DIMENSION bl MAY BE .023 FOR CORNER LEADS. 


16-28 LEAD CERDIP (300 MIL) 


DWG # 

D16-1 

D 1 8 — 1 

D20-1 

D22-1 

D24-1 

D28-3 

# OF LDS (N) 

16 

18 

20 

22 

24 

28 

SYMBOL 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.140 

.200 

.140 

.200 

.140 

.200 

.140 

.200 

.140 

.200 

.140 

.200 

b 

.015 

.021 

.015 

.021 

.015 

.021 

.015 

.021 

.015 

.021 

.015 

.021 

bl 

.038 

.060 

.038 

.060 

.038 

.060 

.045 

.060 

.045 

.065 

.045 

.065 

C 

.009 

.012 

.009 

.012 

.009 

.012 

.009 

.012 

.009 

.014 

.009 

.014 

D 

.750 

.830 


.930 

.935 

innm 

1.050 

■■IKW 

o 

CN 

1.280 

1.440 

-£» 

CD 

O 

E 

.285 

.310 

.285 

.310 

.285 

.310 

.285 

.310 

.285 

.310 

.285 

.310 

El 

.290 

.320 

.290 

.320 

.290 

.320 

.300 

.320 

.300 

.320 

.300 

.320 

e 

.100 BSC 

.100 BSC 

.100 BSC 

.100 BSC 

.100 BSC 

.100 BSC | 

L 

.125 

.175 

.125 

.175 

.125 

.175 

.125 

.175 

.125 

.175 

.125 

.175 

LI 

.150 

- 

.150 

- 

.150 

- 

.150 

- 

.150 

- 

.150 

- 

Q 

.015 

.055 

.015 

.055 

.015 

.060 

.015 

.060 

.015 

.060 

.015 

.060 

S 

.020 

.080 

.020 

.080 

.020 

.080 

.020 

.080 

.030 

.080 

.030 

.080 

SI 

.005 

- 

.005 

- 

.005 

- 

.005 

- 

.005 

- 

.005 

- 

a 

O' 

15' 

O' 

15' 

0' 

15' 

0' 

15' 

0' 

15' 

0' 

15' 



































































































































































































DUAL IN-LINE PACKAGES (Continued) 


24-40 LEAD CERDIP (600 MIL) 


DWG # 


OF LDS 


m 


D24-2 


24 


D28-1 


28 


D40-1 


40 


SYMBOL 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.090 

.190 

.090 

.200 

.160 

.220 

b 

.014 

.023 

.014 

.023 

.014 

.023 

bl 

.038 

.060 

.038 

.065 

.038 

.065 

C 

.008 

.012 

.008 

.014 

.008 

.014 

D 

1.230 

1.290 

1.440 

1.490 

2.020 

2.070 

E 

.500 

.610 

.510 

.545 

.510 

.545 

El 

.590 

.620 

.590 

.620 

.590 

.620 


.100 BSC 


L 

.125 

LI 

.150 

Q 

.015 

S 

.030 

SI 

.005 

a 

0* 



.100 BSC 


.125 .200 


.150 


.020 .060 


.030 .080 


.005 


.100 BSC 


.125 .200 


.150 


.020 .060 


.030 | .080 


.005 


O’ 15* 



28-40 LEAD CERDIP (WIDE BODY) 


DWG # 


OF LDS 


w\ 


D28-2 


28 


D32-1 


32 


D40-2 


40 


SYMBOL 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.090 

.200 

.120 

.210 

.160 

.220 

b 

.014 

.023 

.014 

.02 3 

.014 

.023 

bl 

.038 

.065 

.038 

.065 

.038 

.065 

C 

.008 

.014 

.008 

.014 

.008 

.014 

D 

1.440 

1.490 

1.625 

1.675 

2.020 

2.070 

E 

.570 

.600 

.570 

.600 

.570 

.600 

El 

.590 

.620 

.590 

.620 

.590 

.620 


L 

.125 

LI 

.150 

Q 

.020 

S 

.030 

SI 

.005 

a 

0* 


.100 BSC 


.100 BSC 


.125 .200 


.150 


.020 .060 


.030 .080 


.005 


.100 BSC 


.125 .200 


.150 


.020 .060 


.030 | .080 


.005 


0* I 15* 


.3 


2 



















































































































































































































PACKAGE DIAGRAM OUTLINES 


DUAL IN-LINE PACKAGES (Continued) 


20-32 LEAD SIDE BRAZE (300 MIL) 



SEATING PLANE 


El 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 


DWG # 

C20-1 

C22-1 

C24-1 

C28-1 

C32-3 

# OF LDS (N) 

20 

22 

24 

28 

32 

SYMBOL 

MIN 

MAX 

MW 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.090 

.200 

.100 

.200 

.090 

.200 

.090 

.200 

.090 

.200 

b 

.014 

.023 

.014 

.023 

.015 

.023 

.014 

.023 

.01 P 

.023 

bl 

.040 

.060 

.040 

.060 

.040 

.060 

.040 

.060 

.040 

.060 

c 

.008 

.015 

.008 

.015 

.008 

.015 

.008 

.015 

.008 

.014 

D 

.970 

1.060 

1.040 

1.120 

1.180 

1.230 

1.380 

1.420 

1.580 

1.640 

E 

.260 

.310 

.260 

.310 

.220 

.310 

.220 

.310 

.280 

.310 

El 

.290 

.320 

.290 

.320 

.290 

.320 

.290 

.320 

.290, 

.320 

e 

.100 BSC 

.100 BSC 

.100 BSC 

.100 BSC 

.100 BSC 

L 

.125 

.200 

.125 

.200 

.125 

.200 

.125 

.200 

.100 

.175 

LI 

.150 

- 

.150 

— 

.150 

— 

.150 

— 

.150 

— 

Q 

.015 

.060 

.015 

.060 

.015 

.060 

.015 

.060 

.030 

.060 

S 

.030 

.065 

.030 

.065 

.030 

.065 

.030 

.065 

.030 

.065 

SI 

.005 

- 

.005 

- 

.005 

- 

.005 

— 

.005 

- 

S2 

.005 

- 

.005 

- 

.005 

- 

.005 

- 

.005 

- 


4.3 


3 































































































































































PACKAGE DIAGRAM OUTLINES 


DUAL IN-LINE PACKAGES (Continued) 


28-48 LEAD SIDE BRAZE (400 MIL) 




NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 


DWG # 

C28-2 

C32-2 

C48-1 

# OF LDS (N) 

28 

32 

48 

SYMBOL 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.090 

.200 

.090 

.200 

.085 

.190 

b 

.014 

.023 

.014 

.023 

.014 

.023 

bl 

.040 

.060 

.040 

.060 

.040 

.060 

C 

.008 

.014 

.008 

.014 

.008 

.014 

D 

1.380 

1.420 

1.580 

1.640 

1.690 

1.730 

E 

.380 

.420 

.380 

.410 

.380 

.410 

El 

.390 

.420 

.390 

.420 

.390 

.420 

e 

.100 BSC 

.100 BSC 

.070 BSC 1 

L 

.100 

.175 

.100 

.175 

.125 

.175 

LI 

.150 

- 

.150 

— 

.150 

- 

Q 

.030 

.060 

.030 

.060 

.020 

.070 

S 

.030 

.065 

.030 

.065 

.030 

.065 

SI 

.005 

- 

.005 

— 

.005 


S2 

.005 

- 

.005 

- 

.005 



4.3 


4 







PACKAGE DIAGRAM OUTLINES 


DUAL IN-LINE PACKAGES (Continued) 


24-68 LEAD SIDE BRAZE (600 MIL) 



LJLJLJ 

68 LEAD OPTION 



NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 


DWG # 

C24-2 

C28-3 

C32-1 

C40-1 

C48-2 

C68-1 

# OF LDS (N) 

24 

28 

32 

40 

48 

68 

SYMBOL 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 


.190 

EHEfl 

.190 

.100 

.190 

Kilim 

.190 

.100 

.190 

RES 1 

.190 

b 

.015 

.023 

.015 

.022 

.015 

.023 

.015 

.023 

.015 

.023 

.015 

.023 

bl 

.040 

.060 

.038 

.060 

.040 

.060 

.038 

.060 

.040 

.060 

.040 

.060 

c 

K>I«UI 

.012 


.012 

.008 

.014 


.012 


.012 


.012 

D 

1.180 

1.220 

1.380 

1.430 

1.580 

1.640 

■IKIUtl 


2.370 

2.430 

EBH3 

2.440 

E 

.575 

.610 

.580 

.610 

.580 

.610 

.580 

.610 

.550 

.610 

.580 

.610 

El 

.595 

.620 

.595 

.620 

.590 

.620 

.595 

.620 

.595 

.620 

.590 

.620 

e 

.100 BSC 

.100 BSC 

.100 BSC 

.100 BSC 

.100 BSC 

.070 BSC 1 

L 

.125 

.175 

.125 

.175 

.100 

.175 

.125 

.175 

.125 

.175 

.125 

.175 

LI 

.150 

- 

.150 

— 

.150 

- 

.150 

— 

.150 

— 

.150 

— 

Q 

.020 

.060 

.020 

.065 

.020 

.060 

.020 

.060 

.020 

.060 

.020 

.070 

S 

.030 

.065 

.030 

.065 

.030 

.065 

.030 

.065 

.030 

.065 

.030 

.065 

SI 

.005 

- 

.005 

— 

.005 

— 

.005 

— 

.005 

— 

.005 

— 

S2 

IBMkm 

- 

■>HB1 

- 

■PICT 

- 


- 


- 

HIlM 

- 


4.3 
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PACKAGE DIAGRAM OUTLINES 

















































PACKAGE DIAGRAM OUTLINES 



DUAL IN-LINE PACKAGES (Continued) 





















































PACKAGE DIAGRAM OUTLINES 


FLATPACKS 


20-28 LEAD FLATPACK 





I 


T 

E2 

_L 


r 


E3 


E3 




C 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 




F20-1 

F20-2 

F24-1 

F28-1 

F28-2 

# OF LDS (N) 

20 

»¥il 

24 

28 

28 

SYMBOL 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.045 

.092 

.045 

.092 

.045 

.090 

.045 

.090 

.045 

.115 

b 

.015 

.019 

.015 

.019 

.015 

.019 

.015 

.019 

.015 

.019 

C 

.003 

.006 

.003 

.006 

.003 

.006 

.004 

.007 

.003 

.007 

D 

- 

.540 

- 

.540 

- 

.640 

.710 

.740 

.710 

.740 

E 

.340 

.360 

.245 

.303 

.360 

.420 

.480 

.520 

.480 

.520 

E2 

.130 

- 

.130 

- 

.180 

- 

.180 

- 

.180 

- 

E3 

.030 

- 

.030 

- 

.030 

- 

.040 

- 

.040 

- 

e 

.050 BSC 

.050 BSC 

.050 BSC 

.050 BSC 

.050 BSC | 

K 

.006 

.015 

.008 

.015 

- 

■ - 

- 

- 

- 

- 

L 

.250 

.370 

.250 

.370 

.250 

.370 

.250 

.370 

.250 

.370 

Q 

.010 

.040 

.010 

.040 

.010 

.040 

.010 

.045 

.026 

.045 

S 

- 

.045 

- 

.045 

- 

.045 

- 

.045 

- 

.045 

SI 

.000 

- 

.005 

- 

.005 

- 

.005 

- 

.005 

- 


4.3 
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FLATPACKS (Continued) 




























































FLATPACKS (Continued) 


68 LEAD QUAD FLATPACK 


PIN 1 INDEX 



E2 El E 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 


DWG 


OF LDS 


SYMBOL 


A1 


b 


C 


D/E 


D1/E1 


D2 


e 


L 


ND/NE 


m 


F6* 

3—1 

68 

MIN 

MAX 

.080 

.145 

.070 

.090 

.014 

.021 

.008 

.012 

1.640 

1.870 

.926 

.970 

.800 

BSC 

.050 

BSC 

.350 | 

1 .450 

17 


1.3 


10 










































PACKAGE DIAGRAM OUTLINES 


FLATPACKS (Continued) 

68 LEAD QUAD FLATPACK (STRAIGHT LEADS) 



NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 








































PACKAGE DIAGRAM OUTLINES 




FLATPACKS (Continued) 



84 LEAD QUAD FLATPACK (CAVITY DOWN) 



DWG # 

F84-1 

# OF LDS (N) 

84 

SYMBOL 

MIN 

MAX 

A 

- 

.140 

A1 

- 

.105 

b 

.014 

.020 

C 

.007 

.013 

D/E 

1.485 

1.615 

D1/E1 

1.130 

1.170 

D2/E2 

1.000 BSC 

D3/E3 

.500 BSC 

e 

.050 BSC I 

L 

.350 | .450 

ND/NE 

21 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, 
UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING 
BETWEEN CENTERS. 

3. CROSS HATCHED AREA INDICATES 
INTEGRAL METALLIC HEAT SINK. 



U — .012 MAX 
AT BRAZE PADS 

DETAIL A 


4.3 


12 






FLATPACKS (Continued) 


84 LEAD QUAD FLATPACK (CAVITY UP) 


£! 



PIN 1 INDEX 


iMMi 




E2 El E 


m 


D 


OF LDS 


SYMBOL 


C 


D 


D1/E1 


D2 


D3/E3 


e 


L 


mi\ 


F84-2 


84 


MIN 

MAX 

- 

.140 

- 

.105 

.014 

.020 

.007 

.013 

1.485 

1.615 

1.130 

1.170 

1.000 

BSC 

.500 

BSC 

.050 

BSC 

.350 

.450 

1 2’ 1 



-.018 MAX 


DETAIL 


.012 MAX 
AT BRAZE PADS 

A” 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. CROSS HATCHED AREA INDICATES INTEGRAL METALLIC 
HEAT SINK. 


4.3 


13 







































PACKAGE DIAGRAM OUTLINES 


FLATPACKS (Continued) 


172 LEAD QUAD FLATPACK (CAVITY UP - R3001) 



DWG # 

FI 72-1 

# OF LDS (N) 

172 

SYMBOL 

MIN 

MAX 

A 

- 

.130 

A1 

- 

.105 

b 

.006 

.010 

C 

.004 

.008 

d7e 

1.580 

1.620 

D1/E1 

1.135 

1.165 

D2/E2 

1.050 BSC 

D3/E3 

.525 BSC 

e 

.025 BSC 

L 

.220 | .230 

ND/NE 

43 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, 
UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING 
BETWEEN CENTERS. 


h*7 c 

.008±.006 

AT BRAZE PADS 
DETAIL A 



4.3 


14 





PACKAGE DIAGRAM OUTLINES 


FLATPACKS (Continued) 

172 LEAD QUAD FLATPACK (CAVITY DOWN - R3000A) 



DWG # 

FI 72-2 

# OF LDS (N) 

172 

SYMBOL 

MIN 

MAX 

A 

- 

.130 

A1 

- 

.105 

b 

.006 

.010 

C 

.004 

.008 

d7e 1 

1.580 

1.620 


1.135 

1.165 

D2/E2 

1.050 BSC 

D3/E3 

.525 BSC 

e 

.025 BSC 

L 

.220 | .230 

ND/NE 

43 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, 
UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING 
BETWEEN CENTERS. 

3. CROSS HATCHED AREA INDICATES 
METALLIC HEAT SINK. 



_L 

.018 MAX 



h*— ,008±.006 
AT BRAZE PADS 


DETAIL A 


4.3 
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PACKAGE DIAGRAM OUTLINES 


CERPACKS 


16-28 LEAD CERPACK 




NOTES: 

1. ALL DIMENSION ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 




E16-1 

E20-1 

E24-1 

E28-1 

E28-2 

# OF LDS (N) 

16 

20 

24 

28 

28 

SYMBOL 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.055 

.085 

.045 

.092 

.045 

.090 

.045 

.115 

.045 

.090 

b 

.015 

.019 

.015 

.019 

.015 

.019 

.015 

.019 

.015 

.019 

C 

.0045 

.006 

.0045 

.006 

.0045 

.006 

.0045 

.009 

.0045 

.006 

D 

.370 

.430 

- 

.540 

- 

.640 

- 

.740 

- 

.740 

E 

.245 

.285 

.245 

.300 

.300 

.420 

.460 

.520 

.340 

.380 

El 

- 

.305 

- 

.305 

- 

.440 

- 

.550 

- 

.400 

e 

.050 BSC 

.050 BSC 

.050 BSC 

.050 BSC 

.050 BSC ] 

K 

.008 

.015 

.008 

.015 

.008 

.015 
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PACKAGE DIAGRAM OUTLINES 


CERQUADS 


68 LEAD CERQUAD (STRAIGHT LEADS) 



NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 


DWG # 

CQ68-1 

# OF LDS (N) 

68 

SYMBOL 

MIN 

MAX 

A 

.115 

.165 

b 

.008 

.013 

C 
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.008 

D/E 

.860 

1.100 

D1/E1 
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.500 

D3/E3 

.400 REF 
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.025 BSC 
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.200 | .300 


17 


1.3 


17 



































CERQUADS (Continued) 















PACKAGE DIAGRAM OUTLINES 


LEADLESS CHIP CARRIERS 




NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 



20-48 LEAD LCC (SQUARE) 


DWG # 
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L48-1 

# OF LDS (N) 
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28 
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.100 
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.054 
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.054 
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B1 

.022 

.028 
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.028 

.022 
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.017 

.023 

B2 

.072 REF 

.072 REF 

.072 REF 

.072 REF | 

B3 
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.006 

.022 

.006 

.022 

.006 

.022 

D/E 

.342 

.358 

.442 

.460 

.640 

.660 

.554 

.572 

D1/E1 

.200 BSC 

.300 BSC 

.500 BSC 

.440 BSC 

D2/E2 

.100 BSC 

.150 BSC 

.250 BSC 

.220 BSC ; 

D3/E3 

- | .358 

- | .460 

- | .560 

.500 | .535 

e 

.050 BSC 

.050 BSC 

.050 BSC 

.040 BSC 

el 

.015 | - 

.015 | ~ 

.015 | - 

.015 | - 
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.040 REF 

.040 REF 

.040 REF 

.012 RADIUS 
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.020 REF 

.020 REF 

.020 REF 

.020 REF 
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.055 

.045 

.055 

.045 

.055 
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.047 
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.045 

.055 

.045 

.055 

.045 

.055 
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.047 

L2 
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.093 

.077 
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.077 

.093 

.077 

.093 

L3 

.003 

.015 

.003 

.015 

.003 

.015 

.003 

.015 
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5 

7 

11 

12 1 
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LEADLESS CHIP CARRIERS (Continued) 


52-68 LEAD LCC (SQUARE) 


DWG # L52-1 L52-2 L68-2 L68-1 

OF LDS (N) 52 52 68 68 

SYMBOL MIN MAX~ MIN MAX ~~mTnT MAX "~MIN MAX 
A .061 .087 .082 .120 .082 .120 .065 .120 

A1 .051 .077 .072 .088 .072 .088 .055 .075 

B1 .022 .028 .022 .028 .022 .028 .008 .014 

B2 .072 REF .072 REF .072 REF .072 REF 

B3 .006 .022 .006 .022 .006 .022 .006 .022 

D/E .739 .761 .739 .761 .938 .962 .554 .566 

D1/E1 .600 BSC .600 BSC .800 BSC .400 BSC 

D2/E2 .300 BSC .300 BSC .400 BSC .200 BSC 

D3/E3 = ["7(361 - | .661 - | .862 - |7535 

e .050 BSC .050 BSC .050 BSC .025 BSC 

el .015 | - “015 | - .015 | - .015 | - 

h .040 REF .040 REF .040 REF .040 REF 

J .020 REF .020 REF .020 REF .020 REF 
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.055 

.045 

.055 

.045 

.055 

.045 

.055 

LI 

.045 

.055 

.045 
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.077 

.093 

.077 
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.015 

.003 

.015 

.003 

.015 

.003 

.015 


ND/NE | 13 | 13 | 17 1 17 



















































































































































PACKAGE DIAGRAM OUTLINES 


LEADLESS CHIP CARRIERS (Continued) 



DWG # 
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- 
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- 
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.435 

.480 

.496 
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.560 

.540 

.560 
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.400 BSC 
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E2 
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E3 

- | .410 

- | .480 

= | .408 
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— - | .558 
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.050 BSC 
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L2 
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.095 
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.077 

.093 
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.015 

.003 

.015 

.003 

.015 

.003 
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.003 
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ND 
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5 

7 

NE 

6 

7 

7 

9 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS 


68 PIN PGA (CAVITY UP) 




DWG # 

G68-1 

# OF PINS (N) 

68 

SYMBOL 

MIN 

MAX 

A 

.070 

.145 

0B 

.016 

.020 

0B1 

- 

.080 

0B2 

.040 

.060 

D/E 

1.140 

1.180 

D1/E1 

! 1.000 BSC 

e 

.100 BSC 

L 

.120 | .140 

M 

11 

Q 

.040 

.060 | 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL ”M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL ”N” REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 


84 PIN PGA (CAVITY UP - 12 X 12 GRID) 


BOTTOM VIEW ^0B1 
2345 6,789 10 A\ 12 


TOP VIEW 
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D/E 
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D1/E1 

1.100 BSC 

e 

.100 BSC 
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.120 | .140 

M 

12 

Q 

.025 

.060 | 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL "M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL ”N” REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 


84 PIN PGA (CAVITY UP - 11 X 11 GRID) 


BOTTOM VIEW ,- am 
2 3 4 5 6 7 8 /9 10 11 


TOP VIEW 
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0B 
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0B1 

- 
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0B2 

.040 
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D/E 

1.080 

1.120 

D1/E1 

1.000 BSC 

e 

.100 BSC 

L 

,.120 | .140 

M 

11 

Q 

.040 

.060 | 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL "M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL ”N” REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 
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PACKAGE DIAGRAM OULTINES 


PIN GRID ARRAYS (Continued) 


108 PIN PGA (CAVITY UP) 
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1 23456,789 10/11 12 


©0©©©© 

@©©@@© 

©©©©©© 

@©@ 

@®@ 

©©© 

©©©©©© 

©©©©©© 

©@@®@© 

©©© 

©@© 

©©© 

S 





@@@ 

@@@ 

©@© 

@@©@@@ 

@@©©@@ 

©©©©©© 

@@© 

®@© 

@@© 

@@@©@© 

©©@@@@ 

©©©©©($> 




2L 

ni 

PIN 1 ID— ^ 

D 


SEATING PLANE 



DWG # 

G108-1 

# OF PINS (N) 
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A 
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0B 
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.020 

■0B1 

- 
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0B2 
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.060 

D/E 
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1.212 

D1/E1 

1.100 BSC 

e 

.100 BSC 

L 

.120 | .140 

M 

12 

Q 

.040 .060 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL "M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL "N" REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 


144 PIN PGA (CAVITY UP - R3001) 

BOTTOM VIEW ^0B1 
123456789 10/11 12 13 14 15 
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DWG # 
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D/E 

1.559 
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D1/E1 

1.400 BSC 

e 

.100 BSC 

L 

.120 
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M 

15 

Q 
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NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL ”M" REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL ”N” REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 

6. EXTRA PIN (D-4) ELECTRICALLY CONNECTED TO D-3. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 


208 PIN PGA (CAVITY UP) 
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- 
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0B2 
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.060 

D/E 
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1.780 

D1/E1 

1.600 BSC 

e 

.100 BSC 

L 

.125 | .140 

M 

17 

Q 

.040 | .060 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL ”M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL "1ST REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 

68 PIN PGA (CAVITY DOWN) 
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M 

11 

Q1 
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.060 | 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL "M"- REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL "N" REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 
84 PIN PGA (CAVITY DOWN) 


BOTTOM VIEW r-0 Bl 
1 2345 6,7 89 10711 12 


TOP VIEW 
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NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL ”M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL "N” REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 

84 PIN PGA (CAVITY DOWN - R3010A) 


BOTTO M VIEW r- 0B1 
1 23456,789 10/11 12 


TOP VIEW 
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NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL ”M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL ”N” REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 

6. CROSS HATCHED AREA INDICATES INTEGRALL METALLIC 
HEAT SINK. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 


144 PIN PGA (CAVITY DOW) 

BOTTOM VIEW ^0B1 
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NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL ”M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL "N" REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 
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NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL ”M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL ”N” REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 

6. EXTRA PIN (D— 4) ELECTRICALLY CONNECTED TO D-3. 

7. CROSS HATCHED AREA INDICATES INTEGRAL METALLIC 
HEAT SINK. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 


TOP 

VIEW 







■ 

M 




Kmj 

M 

m 

/ 


m 


175 PIN PGA (CAVITY DOWN - R3000A) 

BOTTOM VIEW ^0B1 
123456789 10711 12 13 14 15 
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)©©©©©©© 

)©©©©©©© 

)©©©©©©© 
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D1 

D 


El E 


NOTE 6 


SEATING PLANE 



DWG # 

G175-1 

# OF PINS (N) 

175 

SYMBOL 

MIN 

MAX 

A 

.082 

.130 

0B 

.016 

.020 

0B1 

.060 

. .080 

0B2 

.040 

.060 

D/E 

1.559 

1.590 

D1/E1 

1.400 BSC 

e 

.100 BSC 

L 

.120 | .140 

M 

15 

Q1 

.025 .060 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL ”M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL ”N” REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 

6. CROSS HATCHEDAREA INDICATES INTEGRAL METALLIC 
HEAT SINK. 
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PACKAGE DIAGRAM OUTLINES 


PIN GRID ARRAYS (Continued) 


208 PIN PGA (CAVITY DOWN) 



DWG # 

G208-2 

# OF PINS (N) 

208 ! 

SYMBOL 

MIN 

MAX 

A 

.070 

.145 

0B 

.016 

.020 

0B1 

- 

,080 

0B2 

.040 

.060 

D/E 

1.732 

1.780 

D1/E1 

1.600 BSC 

e 

.100 BSC 

L 

.125 | .140 

M 

17 

Q1 

.025 

.060 | 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. SYMBOL ”M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL ”N” REPRESENTS THE NUMBER OF PINS 

5. CHAMFERED CORNERS ARE IDT’S OPTION. 
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PACKAGE DIAGRAM OUTLINES 


PLASTIC DUAL IN-LINE PACKAGES 
16-32 LEAD PLASTIC DIP (300 MIL) 




NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 

2. D k El DO NOT INCLUDE MOLD FLASH OR PROTRUSIONS. 


DWG # 

P16-1 

P22-1 

P28-2 

P32-2 

# OF LDS (N) 

16 

22 

28 

32 

SYMBOLS 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.140 

.165 

.145 

.165 

.145 

.180 

.145 

.180 

A1 

.015 

.035 

.015 

.035 

.015 

.030 

.015 

.030 

b 

.015 

.022 

.015 

.022 

.015 

.022 

.016 

.022 

bl 

.050 

.070 

.050 

.065 

.045 

.065 

.045 

.060 

C 

.008 

.012 

.008 

.012 

.008 

.015 

.008 

.015 

D 

.745 

.760 

1.050 

1.060 

1.345 

1.375 

1.545 

1.585 

E 

.300 

.325 

.300 

.320 

.300 

.325 

.300 

.325 

El 

.247 

.260 

.240 

.270 

.270 

.295 

.275 

.295 

e 

.090 

.110 

.090 

.110 

.090 

.110 

.090 

.110 

eA 

.310 

.370 

.310 

.370 

.310 

.400 

.310 

.400 

L 

.120 

.150 

.120 

.150 

.120 

.150 

.120 

.150 

a 

0* 

15’ 

O' 

15‘ 

0' 

15’ 

O’ 

15’ 

S 

.015 

.035 

.020 

.040 

.020 

.042 

.020 

.060 

Q1 

.050 

.070 

.055 

.075 

.055 

.065 

.055 

.065 
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PLASTIC DUAL IN-LINE PACKAGES (Continued) 












































































PACKAGE DIAGRAM OUTLINES 



24-48 LEAD PLASTIC DIP (600 MIL) 


P32-1 



1 24 1 

1 28 1 

1 32 1 

L 40 

i 48 

SYMBOLS 

! MIN 

1 MAX 1 

1 MIN 1 

MAX 1 

1 MIN 1 

MAX 1 

! MIN 1 

MAX 1 

1 MIN 

1 MAX 




IliHI IWtKW 

mwi*m 

■WHIM 

A1 

1 .015 1 

■ftM 

EM 

.035 1 

lilMI 

.050 

1 .015 1 


1 .015 ! 


b 

EM 

.020 

.015 

EM 

.016 

.020 

HIM 

1»W»M 


1 .020 

bl 


.065 

.050 

1 .065 1 

.045 

HiEMI 

1 .050 1 

.065 

.050 


C 

■hhmEm ■PitBliiiiWi 

.008 

iiiw WbUfcto 

.012 

.008 

.012 

D 

1.240 

1 1.260 1 

1 1.420 1 

1.460 1 

1.645 

1 1.655 1 

2.050 

2.070 

2.420 

2.450 

E 

.600 


.600 

.620 

.600 


El 

.530 

.550 


.550 


.550 


e 

.090 

.110 


.110 

.090 

.110 

.090 

.110 

.090 

.110 

eA 

.610 

.670 

.610 

.670 

.610 

.670 

.610 

.670 

.610 

.670 

L 

.120 

.150 

.120 


.125 

.135 

■BM 

.150 

.120 

.150 

a 

O' 

15* 

0* 

1 15- I 

0* 

15* 

0' 

15* 

O' 

15* 

S 

.060 


.055 


.070 

.080 

.070 


.075 
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PACKAGE DIAGRAM OUTLINES 


SMALL OUTLINE IC 


b“ s 



p| N i-4) yyyyyyyt 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. D & E DO NOT INCLUDE MOLD FLASH OR PROTRUSIONS 
AND TO BE MEASURED FROM THE BOTTOM OF PKG. 

4. FORMED LEADS SHALL BE PLANAR WITH RESPECT TO 
ONE ANOTHER WITHIN .004” AT THE SEATING PLANE. 



16-24 LEAD SMALL OUTLINE (GULL WING) 


| DWG # 

SOI 6-1 

S018-1 

S020-2 

S024-2 

S024-3 


16 (.300) 

18 (.300) 

20 (.300”) 


24 (.300”) 

SYMBOL 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.095 

.1043 

.095 

.1043 

.095 

.1043 

.095 

.1043 

.110 

.120 

A1 

.005 

.0118 

.005 

.0118 

.005 

.0118 

.005 

.0118 

.005 

.0118 

B 

.014 

.020 

.014 

.020 

.014 

.020 

.014 

.020 

.014 

.020 

C 

.0091 

.0125 

.0091 

.0125 

.0091 

0125 

.0091 

.0125 

.007 

.011 

D 

.403 

.413 

.447 

.462 

.497 

.511 

.600 

.614 

.620 

.630 

e 

.050 BSC 

.050 BSC 

.050 BSC 

.050 BSC 

.050 BSC | 

E 

.292 

.2992 

.292 

.2992 

.292 

.2992 

.292 

.2992 

.295 

.305 

h 

.010 

.020 

.010 

.020 

.010 

.020 

.010 

.020 

.012 

.020 

H 

.400 

.419 

.400 

.419 

.400 

.419 

.400 

.419 

.406 

.419 

L 

.018 

.045 

.018 

.045 

.018 

.045 

.018 

.045 

.028 

.045 

a 

0* 

8* 

0* 

8* 

0' 

8* 

0* 

8' 

O' 

8* 

S 

.023 

.035 

.023 

.035 

.023 

.035 

.023 

.035 

.032 

.043 
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PACKAGE DIAGRAM OUTLINES 


SMALL OUTLINE IC (Continued) 


PIN 1 



NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN CENTERS. 

3. D & E DO NOT INCLUDE MOLD FLASH OR PROTRUSIONS 
AND TO BE MEASURED FROM THE BOTTOM OF THE PKG. 

4. FORMED LEADS SHALL BE PLANAR WITH RESPECT TO 
ONE ANOTHER WITHIN .004” AT THE SEATING PLANE. 



28 LEAD SMALL OUTLINE (GULL WING) 


DWG # 

S028-2 

S028-3 

# OF LDS (N) 

28 (.300”) 

28 (.330”) 

SYMBOL 

MIN 

MAX 

MIN 

MAX 

A 

.095 

.1043 

.110 

.120 

A1 

.005 

.0118 

.005 

.014 

B 

.014 

.020 

.014 

.019 

C 

.0091 

.01 25 

.006 

.010 

D 

.700 

.712 

.718 

.728 

e 

.050 BSC 

.050 BSC | 

E 

.292 

.2992 

.340 

.350 

h 

.010 

.020 

.012 

.020 

H 

.400 

.419 

.462 

.478 

L 

.018 

.045 

.028 

.045 

a 

O' 

8* 

O' 

8' 

S 

.023 

.035 

.023 

.035 
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16-24 LEAD SMALL OUTLINE (EIAJ - .0315 PITCH) 



1 DWG # 1 

SOI 6—5 

HEMEBTOH 

16 

SYMBOLS 

MIN 

MAX 

A 

.057 

.071 

A1 

.002 

TYP 

B 

.012 

.020 

C 

.006 

.010 

D 

.248 

.271 

E 

.165 

.180 

e 

.0315 

CD 

00 

o 

H 

.232 

.256 

L 

.010 

- 

a 

0* 

8* 


S020-5 

20 

MIN MAX 
.069 .083 


.002 

TYP 

.012 

.020 

.006 

.010 

.331 

.354 

.205 

.220 

.0315 

- BSC 

.295 

.319 

.010 

- 

0‘ 

8* 


S024-5 

24 

MIN MAX~ 
.069 .083 

.002 TYP 


.012 

.020 

.006 

.010 

.382 

.405 

.205 

.220 

.0315 

CD 

CO 

o 

.295 

.319 

.010 

- 

0‘ 

8‘ 



16-28 LEAD SMALL OUTLINE (EIAJ - .050 PITCH) 







































































































































































































PACKAGE DIAGRAM OUTLINES 


SMALL OUTLINE IC (Continued) 



NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, 

UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN 
CENTERS. 

3. D1 & El DO NOT INCLUDE MOLD FLASH 
OR PROTRUSION AND TO BE MEASURED 
FROM THE BOTTOM OF THE PKG. 

4. FORMED LEADS SHALL BE PLANAR WITH 
RESPECT TO ONE ANOTHER WITHIN .004' 
AT THE SEATING PLANE 


— 

D1 ■ 

— — B (OPTIONAL) 

1 



1 f 

w 

B1 .025 Ml 

U« ; 

J — L SEATING PLANE 

N— J T 



16-24 LEAD SMALL OUTLINE (J-BEND) 


DWG # 

S016-2 

S020-1 

S024-4 

# OF LDS (N) 

16 LD (.300") 

20 LD (.300”) 

24 LD (.300") 

SYMBOLS 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.120 

.140 

.120 

.140 

.130 

.148 

A1 

.078 

.095 

.078 

.095 

.082 

.095 

B 

.020 

.024 

.020 

.024 

.026 

.032 

B1 

.014 

.020 

.014 

.020 

.015 

.020 

C 

.008 

.013 

.008 

.013 

.007 

.011 

D1 

.400 

.412 

.500 

.512 

.620 

.630 

E 

.335 

.347 

.335 

.347 

.335 

.345 

El 

.292 

.300 

.292 

.300 

.295 

.305 

E2 

.262 

.272 

.262 

.272 

.260 

.280 

e 

.050 BSC 

.050 BSC 

.050 BSC | 

h 

.010 

.020 

.010 

.020 

.010 

.020 

S 

.023 

.035 

.023 

.035 

.032 

.043 
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PACKAGE DIAGRAM OUTLINES 


SMALL OUTLINE IC (Continued) 



NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, 
UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING 
BETWEEN CENTERS. 

3. D1 & El DO NOT INCLUDE MOLD 
FLASH OR PROTRUSION AND TO 
BE MEASURED FROM THE BOTTOM 
OF THE PKG. 

4. FORMED LEADS SHALL BE PLANAR 
WITH RESPECT TO ONE ANOTHER 
WITHIN .004” AT THE SEATING 
PLANE. 


— 

h- 

D1 

-B (OPTIONAL) 





w 

B1 

Wit" 

.025 MIN— 1 ' 




28-32 LEAD SMALL OUTLINE (J-BEND) 


DWG # 

S028-5 

S028-4 

S032-2 

# OF LDS (N) 

28 LD (.300") 

28 LD (.350") 

32 LD (.300'') 

SYMBOLS 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.120 

.140 

.130 

.148 

.130 

.148 

A1 

.078 

.095 

.082 

.095 

.082 

.095 

B 

.020 

.024 

.026 

.032 

.026 

.032 

B1 

.014 

.020 

.016 

.020 

.016 

.020 

C 

.008 

.013 

.007 

.011 

.008 

.013 

D1 

.700 

.712 

.720 

.730 

.820 

.830 

E 

.335 

.347 

.380 

.390 

.330 

.340 

El 

.292 

.300 

.345 

.355 

.295 

.305 

E2 

.262 

.272 

.310 

.330 

.260 

.275 

e 

.050 BSC 

.050 BSC 

.050 BSC S 

h . 

.012 

.020 

.012 

.020 

.012 

.020 

S 

.023 

.035 

.023 

.035 

.032 

.043 
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PACKAGE DIAGRAM OUTLINES 


SMALL OUTLINE IC (Continued) 

48 & 56 LEAD SMALL OUTLINE (SSOP - GULL WING) 




NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS 
OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN 
CENTERS. 

3. D & E DO NOT INCLUDE MOLD FLASH OR 
PROTRUSIONS. 

4. FORMED LEADS SHALL BE PLANAR WITH 
RESPECT TO ONE ANOTHER WITHIN .004” 
AT THE SEATING PLANE. 



DWG # 

S048-1 

S056-1 

# OF LDS (N) 

48 (.300”) 

56 (.300”) 

SYMBOL 

MIN 

MAX 

MIN 

MAX 

A 

.095 

.110 

.095 

.110 

A1 

.008 

.016 

.008 

.016 

b 

.008 

.012 

.008 

.012 

C 

.005 

.009 

.005 

.009 

D 

.620 

.630 

.720 

.730 

E 

.291 

.299 

.291 

.299 

e 

.025 BSC 

.025 BSC | 

H 

.395 

.420 

.395 

.420 

h 

.015 

.025 

.015 

.025 

L 

.020 

.040 

.020 

.040 

a 

0* 

8* 

0* 

8* 
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PACKAGE DIAGRAM OUTLINES 


PLASTIC QUAD FLATPACKS 


100-132 LEAD PLASTIC QUAD FLATPACK (JEDEC) 



NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS 
OTHERWISE SPECIFIED. 

2. BSC - BASIC LEAD SPACING BETWEEN 
CENTERS. 

3. PIN 1 IDENTIFIER CAN BE POSITIONED AT 
EITHER ONE OF THESE TWO LOCATIONS. 

4. DIMENSIONS D1, D2, El, AND E2 DO NOT 
INCLUDE MOLD PROTRUSIONS. ALLOWABLE 
MOLD PROTRUSIONS ARE AS FOLLOWS: 

D1 & El = .010 MAX. 

D2 & E2 = .007 MAX. 

5. ND &c NE REPRESENT NUMBERS OF LEADS 
IN D & E DIRECTIONS RESPECTIVELY. 


B (OPTIONAL) 



SEATING PLANE 


.025 MIN 
(OPTIONAL) 



DWG * 

PQ100-1 

PQ132-1 

# OF LDS (N) 

100 

132 

SYMBOLS 

MIN 

MAX 

MIN 

MAX 

A 

.160 

.180 

.160 

.180 

A1 

.020 

.040 

.020 

.040 

B 

.008 

.016 

.008 

.016 

bl 

.008 

.012 

.008 

.012 

C 

.0055 

.008 

.0055 

.008 

D 

.875 

.885 

1.075 

1.085 

D1 

.747 

.753 

.947 

.953 

D2 

.897 

.903 

1.097 

1.103 

D3 

.600 REF 

.800 REF 

e 

.025 BSC 

.025 BSC 

E 

.875 

.885 

1.075 

1.085 

El 

.747 

.753 

.947 

.953 

E2 

.897 

.903 

1.097 

1.103 

E3 

.600 REF 

.800 REF 1 

L 

.020 

.030 

.020 

.030 

a 

O' 

8* 

0* 

8* 

ND/NE 

25/25 

33/33 1 
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PACKAGE DIAGRAM OUTLINES 


PLASTIC QUAD FLATPACKS (Continued) 


80-128 LEAD PLASTIC QUAD FLATPACK (EIAJ) 



NOTES: 


1 . 

2 . 

3. 

4. 


5 


ALL DIMENSIONS ARE IN INCHES, UNLESS 
OTHERWISE SPECIFIED. 

BSC - BASIC LEAD SPACING BETWEEN 
CENTERS. 

D1 & El DO NOT INCLUDE MOLD PROTRUSION. 
ALLOWABLE PROTRUSION IS .010 PER SIDE. 

ND & NE REPRESENT NUMBERS OF LEADS IN 
D & E DIRECTIONS RESPECTIVELY. 

THE 3.9mm FOOTPRINT IS STANDARD, HOWEVER 
THE 3.2mm IS OPTIONAL & CAN BE REQUESTED. 


»iYf wmm 

PQ80-2 

PQ100-2 

PQ120-2 

PQ128-2 

risldLklffl] 

80 

100 

120 

128 

SYMBOLS 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

1 A 

.110 

.124 

.110 

.124 

.136 

.156 

.136 

.156 

A1 

.010 

- 

.010 

- 

.010 

— 

.010 

— 

A2 

.100 

.120 

.100 

.120 

.125 

.144 

.125 

.144 

C 

.005 

.008 

.005 

.008 

.005 

.008 

.005 

.008 

D 

.937 

.945 

.937 

.945 

1.252 

1.260 

1.252 

1.260 

D1 

.783 

.791 

.783 

.791 

1.098 

1.106 

1.098 

1.106 

D3 

.724 REF 

.742 REF 

.913 REF 

.976 REF 1 

E 

.701 

.709 

.701 

.709 

1.252 

1.260 

1.252 

1.260 

El 

.547 

.555 

.547 

.555 

1.098 

1.106 

1.098 

1.106 

E3 

.472 REF 

.486 REF 

.913 REF 

.976 REF 

L 

.026 I .037 

.026 | .037 

.026 1 .037 

.026 | .037 


16/24 

20/30 

30/30 

32/32 

p 

.0315 BSC 

.026 BSC 

.026 BSC 

.0315 BSC 

w 

.012 1 .018 

.012 1 .018 

.012 | .018 

.012 1 .018 

ZD 

.032 

.023 

.094 

.063 

ZE 

.039 

.032 

.094 

.063 


1 ALT. 

D 

m 

.909 

.917 

.909 

.917 

1.224 

1.232 

1.224 

1.232 


11 

ihi 

.673 

.681 

.673 

.681 

1.224 

1.232 

1.224 

1.232 
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PACKAGE DIAGRAM OUTLINES 


PLASTIC QUAD FLATPACKS (Continued) 


144-208 LEAD PLASTIC QUAD FLATPACK (EIAJ) 









































































































































PLASTIC LEADED CHIP CARRIERS 


20-84 LEAD PL CC (SQUARE) 


45' x .045 




— L D3/E3 I 

“T I D2 / 

L-M I 


SEATING PLANE 


NOTES: 


ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 
BSC - BASIC LEAD SPACING BETWEEN CENTERS 
D & E DO NOT INCLUDE MOLD FLASH OR PROTRUSIONS. 

FORMED LEADS SHALL BE PLANAR WITH RESPECT TO ONE 
ANOTHER WITHIN .004” AT THE SEATING PLANE. 

ND & NE REPRESENT NUMBER OF LEADS IN D & E DIRECTIONS 
RESPECTIVELY. 

D1 & El SHOULD BE MEASURED FROM THE BOTTOM OF THE PKG. 


DWG 


# OF LDS 


SYMBOL 


A1 


B 


bl 


C 


Cl 


D 


D1 


D2/E2 


D3/E3 


E 


El 


e 


J20-1 

J28-1 

J44-1 

J52-1 

J68-1 

20 

28 

44 

52 

68 


I KngjKnrnKT3ilKnTnKT3ilKTiTnKngKf^in3ilK??fil i 

I jg^Ki wa Egu mi wmmm 111111111 | 

I mm KBI miHliVi i miH UiyH HUSSESI I 

WSEBIWiHS] BiEIHKSHa ■iwnKHil IwJilIBgffl 

WtfMWIM nlihiRUM nidnuei 


MIN 


.165 


.095 


.026 .032 


HIES?*] EBBS) Bfehlkamj EETniEKi] EEgHESfill 


.200 REF .300 REF .500 REF .600 REF .800 REFI 1.000 REF 


1.185 


.350 .356 




.050 BSC 

.050 BSC 

.050 BSC 

.050 BSC 

.050 BSC 

5 

7 

11 

13 

17 
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PLASTIC LEADED CHIP CARRIERS (Continued) 


18-32 LEAD PLCC (RECTANGULAR) 

y-PIN 1 

45* X .045 -a j>£ 


El E 







D3 (ND) 

— D1 — 

— D — 


SEATING PLANE 




-bl I 
: D2 


OPTIONAL FEATURE 
ADHESIVE PEDESTAL 
(32 LD ONLY) 


DWG # 

# OF LDS 
SYMBOL 
A 
A1 

1 

bl 

C 

Cl 

02 
D 

m ~ 

D2 

D3 


J32-1 


1 

8 

32 

MIN 

MAX 

MIN 

MAX 

.120 

.140 

.120 

.140 

.075 

.095 

.075 

.095 

.026 

.032 

.026 

.032 

.013 

.021 

.013 

.021 

.015 

.040 

.015 

.040 

.008 

.012 

.008 

.012 

- 

- 

.005 

.015 

.320 

.335 

.485 

.495 

.289 

.293 

.449 

.453 

.225 

.265 

.390 

.430 


.150 REF 


.300 REF 


E 

.520 

.535 

.585 

.595 

El 

.489 

.493 

.549 

.553 

E2 

.422 

.465 

.490 

.530 

E3 

.200 

REF 

.400 

REF 

e 

.050 

BSC 

.050 

BSC 

ND/NE 

4/5 

7/9 | 


NOTES: 
1. ALL 


ALL DIMENSIONS ARE IN INCHES, UNLESS 
OTHERWISE SPECIFIED. 

BSC - BASIC LEAD SPACING BETWEEN 
CENTERS. 

D & E DO NOT INCLUDE MOLD FLASH OR 
PROTRUSIONS. 

FORMED LEADS SHALL BE PLANAR WITH 
RESPECT TO ONE ANOTHER WITHIN .004” 

AT THE SEATING PLANE. 

ND & NE REPRESENT NUMBERS OF LEADS IN 
D & E DIRECTIONS RESPECTIVELY. 

D1 & El SHOULD BE MEASURED FROM THE 
BOTTOM OF THE PACKAGE. 
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PACKAGE DIAGRAM OUTLINES 


PLASTIC PIN GRID ARRAYS 


68-208 PIN PGA (CAVITY UP) 



, 

11 




A 

SEATING PLANE -J- 

-il 

-1 

FI 

e 

tv n linn 

1— 0C — 

u 

t 


NOTES: 

1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE SPECIFIED. 

2. BSC - BASIC PIN SPACING BETWEEN CENTERS. 

3. SYMBOL ”M” REPRESENTS THE PGA MATRIX SIZE. 

4. SYMBOL ”N” REPRESENTS THE NUMBER OF PINS. 

5. DIM. ”A” INCLUDES BOTH THE PKG BODY & THE LID. IT DOES NOT INCLUDE HEATSINK OR 
OTHER ATTACHED FEATURES. 

6. PIN DIAMETER ”C” EXCLUDES SOLDER DIP OR OTHER LEAD FINISH. 

7. PIN TIPS MAY HAVE RADIUS OR CHAMFER. 


DWG No. 

PG 68-2 

PG 84-2 

PG 208-2 

Wm 8MJT3b«gW 

68 PIN 

84 PIN 

208 PIN 

SYMBOLS 

MIN 

MAX 

MIN 

MAX 

MIN 

MAX 

A 

.115 

.160 

.115 

.160 

.115 

.160 

C 

.016 

.020 

.016 

.020 

.016 

.020 

D 

1.140 

1.180 

1.140 

1.180 

1.740 

1.780 

D1 

1.000 BSC 

1.000 BSC 

1.600 BSC 

E 

1.140 

1.180 

1.140 

1.180 

1.740 

1.780 j 

El 

1.000 BSC 

1.000 BSC 

1.600 BSC 

e 

.100 BSC 

.100 BSC 

.100 BSC 

L 

.100 

.160 

.100 

.160 

.100 

.160 

M 

11 

_ 11 

17 

Q 

.040 

.070 

.040 

.070 

.040 

.070 | 
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IDT RISC PROCESSING COMPONENTS 

THE COMPLETE RISC SOLUTION 

Integrated DeviceTechnology, Inc. is dedicatedto providing 
complete RISC design solutions by combining expertise in 
silicon processes with leadership products in development 
systems and software. Long an industry leader in the fastest 
static RAMs and high-speed logic, IDT now offers RISC 
system building blocks comprised of components and board- 
level subsystems. 

As a semiconductorpartnerwith MIPS Computer Systems, 

IDT has established a leadership position in the RISC 
marketplace by supplying the fastest CPUs at 40MHz, 
pioneering RISC CPU Subsystem™ modules, and offering 
cost-effective development tools and software. 

The MIPS architecture has become an industry standard 
and has been adopted by over 1 00 leading OEM manufacturers 
including DEC, Sony, Tandem, NEC, CDC, Adobe, Siemens, 

Nixdorf, Honeywell Bull and Silicon Graphics. The MIPS ISA 
(Instruction Set Architecture) has been selected by JIAWG as 
the 32-bit microprocessor standard for military avionics. 

RISComponent™ FAMILY OVERVIEW 

The R3000 Family consists of the R3000 RISC CPU, the 
R3001 and R3051/52 RISControllers™, the R3010 Floating- 
Point Accelerator, the R3500 RISCore™ and the R3020 
Write Buffer. The R3000 processor is a derivative of the 
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Figure. 1. IDT79R3000 Processor 


R2000A, the first commercially available RISC processor 
introduced in 1985. The R3001 RISControllerandthe R3051 
family are versions of the processor tailored for embedded 
control and low-cost workstations. The R3500 integrates 
floating point capability onto the R3000 pinout. The R4000 is 
the third generation of the MIPS RISC architecture that sets a 
new performance standard for the 1990s. 


THE IDT79R3000 CPU 

The R3000 processor consists of two tightly-coupled 
processors implemented on a single chip. 

The first processor is a full 32-bit Harvard Architecture CPU 
consisting of 32 registers, an integer ALU, a single-cycle 
shifter and a multiplier/divider. The second processor is a 
system control coprocessor containing a Translation Look- 
aside Buffer (TLB) and control registers to support a virtual 
memory space of 4GBytes and separate Instruction and Data 
caches. 

The R3000 CPU features: 

• Full 32-bit operation 

• Three instruction formats 

• Efficient 5-stage pipeline 

• On-chip cache control 

• On-chip Memory Management Unit 

• Multiprocessor capability 
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Figure 2. IDT79R3010 Floating-Point Accelerator 


THE IDT79R3010 FLOATING-POINT 
ACCELERATOR 

The R3010 Floating-Point Accelerator (FPA) supports full 
conformance with the IEEE 754 floating-point specification. It 
acts as aco-processorto the R3000 CPU , providing a seamless 
integration of fixed and floating-point instructions. All floating- 
point operations are transparent to the programmer. 

The R3010 FPA features: 

• Full 64-bit operation 

• Single-cycle load/store instructions 

• Seamless interface to the R3000 or R3001 CPU 

• Three operation units (add/subtract, multiply and divide) 
can operate in parallel 

• Six-level pipeline 

THE IDT79R3001 RISController™ 

The R3001 RISController optimizes the high-performance 
MIPS architecture for embedded control systems. Capable of 
28 MIPS performance at 33MHz, the R3001 incorporates new 
features for real-time control. The controller extends the 
performance range of the current R3000 processor, saves 
valuable real estate for space-critical designs and lowers 
system memory costs. 

The MIPS performance range is extended by the increase 
in the R3000 in the synchronous memory space from 
512KBytes, maximum,, to a full 32MBytes. This allows the 
system to perform with a guaranteed “cache” hit rate of 1 00%. 
The on-chip memory controller allows the designer to use 
standard SRAMs, DRAMs, or even VRAMs, representing a 


significant cost savings over other solutions. The processor 
supports predictable interrupt response times for real-time 
control applications, and system chip count is lowered by 
substantially reducing the number of devices needed to 
implement local memory. 

THE R3051 FAMILY OF RISControllers™ 

The IDT79R3051 Family is a derivative of the R3000, 
featuring a high level of integration and targeted to high- 
performance but cost-sensitive embedded processing 
applications. The R3051 family is designed to bring the high- 
performance inherent in the MIPS RISC architecture into low- 
cost, simplified, power sensitive applications. 

Functional units were integrated onto the CPU core in order 
to reduce the total system cost rather than to increase the 
inherent performance of the integer engine. Thus, the R3051 
family is able to offer 35 MIPS of integer performance at 
40MHz without requiring external SRAM or caches. 

THE R3500A RISCore™ CPU/FPA 

The R3500A is a a single chip that integrates the R3000A 
CPU and the R301 0A FPA execution units using the R3000A 
packaging and pinout. This high integration device is 
completely binary software, compatible with the R3000, 
R2000 CPUs and R3010FPA to facilitate the migration 
path to higher performance and lower chip count systems 
that utilize both the CPU and the floating point units. 

IDT has also made several enhancements to the R3000 
architecture such as faster multiply and divide instructions and 
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added a programmable tag bus width allowing reduced cache 
cost. Thepowerconsumptionis lowerby33%whencompared 
to the standard R3000 and R3010. 

THE IDT79R4000 CPU 

The R4000 is the third generation of MIPS RISC technology 
and establishes a new performance standard for RISC 
processors forthe 1990s. The R4000 extends the performance 
range served by the MIPS architecture and, thereby, 
provides a migration path to applications served by the R3000, 
R3001 and the R3051 . 

This third generation processor maintains full binary 
compatibility with applications executing on the R2000/ 


R3000 and IDT's RISController family, while achieving 
substantially higherperformance. The key to this performance 
is both the architecture/implementation of the processor and 
the high level of integration achieved in a single chip. The 
R4000 contains the RISC integer unit, floating-point unit, 
MMU, 8K of I- and D-cache, along with multiprocessing 
support such as direct control of optional secondary caches. 
To achieve performance levels capable of over 50 VAXMIPS 
sustained performance, the R4000 utilizes technology such 
as super-pipelining to exploit 2 level instruction parallelism 
with no issue restrictions. The R4000 presents a balanced 
architectural approach to achieve a wide range of price 
performance goals. 
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IDT79R3000A 

IDT79R3000AE 



RISC CPU PROCESSOR 


FEATURES: 

• Enhanced instruction set compatible version of the 
IDT79R2000, IDT79R3000 RISC CPUs. 

• Upwardly pin-compatible with IDT79R3000 RISC CPU. 

• IDT79R3000A “E” version relaxes system memory timing 
requirements in a high-speed systems. 

• Full 32-bit Operation — Thirty-two 32-bit registers and all 
instructions and addresses are 32-bit. 

• Efficient Pipelining — The CPU’s 5-stage pipeline design 
assists in obtaining an execution rate approaching one 
instruction per cycle. Pipeline stalls and exceptions are 
handled precisely and efficiently. 

• On-Chip Cache Control — The IDT79R3000A provides a 
high bandwidth memory interface that handles separate 
external Instruction and Data Caches ranging in size from 
4 to 256 Kbytes each. Both caches are accessed during a 
single CPU cycle. All cache control is on-chip. 

• On-Chip Memory Management Unit— A fully-associative, 
64 entry Translation Lookaside Buffer (TLB) provides fast 
address translation for virtual-to-physical memory map- 
ping of the 4 Gigabyte virtual address space. 


• Dynamically able to switch between Big- and Little- Endian 
byte ordering conventions. 

Coprocessor Interface — The IDT79R3000A generates all 
addresses and handles memory interface control for up to 
three additional tightly coupled external processors. 
Optimizing Compilers are available for C, Fortran, Pascal, 
COBOL, Ada, and PU1. 

UNIX™ System V.3 and BSD 4.3 operating systems 
supported. 

High-speed CEMOS™ technology. 

16.7 through 40MHz clock rates yield up to 32 VUPS 
sustained throughput. 

Supports independent multiword block refill of both the 
instruction and data caches with variable block sizes. 
Supports concurrent refill and execution of instructions. 
Partial word stores executed as read-modify-write opera- 
tions. 

6 external interrupt inputs, 2 software interrupts, with single 
cycle latency to exception handler routine. 

Flexible multiprocessing support on chip with no impact on 
uniprocessor designs. 

• Military product compliant to MIL-STD-883, Class B. 


IDT79R3000A PROCESSOR 

CPO CONTROL CPU 
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DESCRIPTION: 

The IDT 79R3000A RISC Microprocessor consists of two 
tightly-coupled processors integrated on a single chip. The 
first processor is a full 32-bit CPU based on RISC (Reduced 
Instruction Set Computer) principles to achieve a new stan- 
dard of microprocessor performance. The second processor 
is a system control coprocessor, called CPO, containing a 
f ully-associative 64 entry TLB (T ranslation Lookaside Buffer) , 
MMU (Memory Management Unit) and control registers, sup- 
porting a 4 Gigabyte virtual memory subsystem, and a Harvard 
Architecture Cache Controller achieving a bandwidth of 320 
Mbytes/second using industry standard static RAMs. 

This data sheet provides an overview of the features and 
architecture of the 79R3000A CPU, Revision 3.0. A more 
detailed description of the operation of the device is incorpo- 
rated in the “R3000A Family Hardware User Manual", and a 
more detailed architectural overview is provided in the “mips 
RISC Architecture” book, both available from IDT. Documen- 
tation providing details of the software and development 
environments supporting this processor are also available 
from IDT. 

IDT79R3000A CPU Registers 

The IDT79R3000A CPU provides 32 general purpose 32- 
bit registers, a 32-bit Program Counter, and two 32-bit regis- 
ters that hold the results of integer multiply and divide opera- 
tions. Only two of the 32 general registers have a special 
purpose: register rO is hardwired to the value “0”, which is a 
useful constant, and register r31 is used as the link register in 
jump-and-link instructions (return addressforsubroutine calls). 

The CPU registers are shown in Figure 2. Note that there 
is no Program Status Word (PSW) register shown in this 
figure: the functions traditionally provided by a PSW register 
are instead provided in the Status and Cause registers incor- 
porated within the System Control Coprocessor (CPO). 


Instruction Set Overview 

All IDT79R3000A instructions are 32 bits long, and there 
are only three instruction formats. This approach simplifies 
instruction decoding, thus minimizing instruction execution 
time. The 79R3000A processor initiates a new instruction on 
every run cycle, and is able to complete an instruction on 
almost every clock cycle. The only exceptions are the Load 
instructions and Branch instructions, which each have a single 
cycle of latency associated with their execution. Note, how- 
ever, that in the majority of cases the compilers are able to fill 
these latency cycles with useful instructions which do not 
require the result of the previous instruction. This effectively 
eliminates these latency effects. 

The actual instruction set of the CPU was determined after 
extensive simulations to determine which instructions should 
be implemented in hardware, and which operations are best 
synthesized in software from other basic instructions. This 
methodology resulted in the R3000A having the highest 
performance of any available microprocessor. 
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Figure 3. IDT79R3000A Instruction Formats 


General Purpose Registers 
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Figure 2. IDT79R3000A CPU Registers 


The IDT79R3000A instruction set can be divided into the 

following groups: 

• Load/Store instructions move data between memory and 
general registers. They are all l-type instructions, since the 
only addressing mode supported is base register plus 16- 
bit, signed immediate offset. 

The Load instruction has a single cycle of latency, which 
means that the data being loaded is not available to the 
instruction immediately afterthe load instruction. The com- 
piler will fill this delay slot with either an instruction which is 
not dependent on the loaded data, or with a NOP instruc- 
tion. There is no latency associated with the store instruc- 
tion. 

Loads and Stores can be performed on byte, haff-word, 
word, or unaligned word data (32 bit data not aligned on a 
modulo-4 address). The CPU cache is constructed as a 
write-through cache. 

• Computational instructions perform arithmetic, logical 
and shift operations on values in registers. They occur in 
both R-type (both operands and the result are registers) 
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and l-type (one operand is a 16-bit immediate) formats. 
Note that computational instructions are three operand 
instructions; that is, the result of the operation can be 
stored into a different register than either of the two 
operands. This means that operands need not be overwrit- 
ten by arithmetic operations. This results in a more efficient 
use of the large register set. 

• Jump and Branch instructions change the control flow of 
a program. Jumps are always to a paged absolute address 
formed by combining a 26-bit target with four bits of the 
Program counter (J-type format, for subroutine calls), or 
32-bit register byte addresses (R-type, for returns and 
dispatches). Branches have 16-bit offsets relative to the 
program counter (l-type). Jump and Link instructions save 
a return address in Register 31 . The 79R3000A instruction 
set features a number of branch conditions. Included is the 
ability to compare a register to zero and branch, and also 
the ability to branch based on a comparison between two 


registers. Thus, net performance is increased since soft- 
ware does not have to perform arithmetic instructions prior 
to the branch to set up the branch conditions. 

• Coprocessor instructions perform operations in the 
coprocessors. Coprocessor Loads and Stores are l-type. 
Coprocessorcomputational instructions have coprocessor- 
dependent formats (see coprocessor manuals). 

• Coprocessor 0 instructions perform operations on the 
System Control Coprocessor (CPO) registers to manipu- 
late the memory management and exception handling 
facilities of the processor. 

• Special instructions perform a variety of tasks, including 
movement of data between special and general registers, 
system calls, and breakpoint. They are always R-type. 

Table 1 lists the instruction set of the IDT79R3000A 

processor. 
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OP 

Description 

OP 

Description 


Load/Store Instructions 


Multiply/Divide Instructions 

LB 

Load Byte 

MULT 

Multiply 

LBU 

Load Byte Unsigned 

MULTU 

Multiply Unsigned 

LH 

Load Halfword 

DIV 

Divide 

LHU 

Load Halfword Unsigned 

DIVU 

Divide Unsigned 

LW 

Load Word 

MFHI 

Move From HI 

LWL 

Load Word Left 

MTHI 

Move To HI 

LWR 

Load Word Right 

MFLO 

Move From LO 

SB 

Store Byte 

MTLO 

Move To LO 

SH 

Store Halfword 



SW 

Store Word 


Jump and Branch Instructions 

SWL 

Store Word Left 

J 

Jump 

SWR 

Store Word Right 

JAL 

Jump and Link 



JR 

Jump to Register 


Arithmetic Instructions 

JALR 

Jump and Link Register 


(ALU Immediate) 

BEQ 

Branch on Equal 

ADDI 

Add Immediate 

BNE 

Branch on Not Equal 

ADDIU 

Add Immediate Unsigned 

BLEZ 

Branch on Less than or Equal to Zero 

SLTI 

Set on Less Than Immediate 

BGTZ 

Branch on Greater Than Zero 

SLTIU 

Set on Less Than Immediate 

BLTZ 

Branch on Less Than Zero 


Unsigned 

BGEZ 

Branch on Greater than or 

ANDI 

AND Immediate 


Equal to Zero 

ORI 

OR Immediate 

BLTZAL 

Branch on Less Than Zero and Link 

XORI 

Exclusive OR Immediate 

BGEZAL 

Branch on Greater than or Equal to 

LUI 

Load Upper Immediate 


Zero and Link 




Special Instructions 


Arithmetic Instructions 

SYSCALL 

System Call 


(3-operand, register-type) 

BREAK 

Break 

ADD 

Add 



ADDU 

Add Unsigned 


Coprocessor instructions 

SUB 

Subtract 

LWCz 

Load Word from Coprocessor 

SUBU 

Subtract Unsigned 

SWCZ 

Store Word to Coprocessor 

SLT 

Set on Less Than 

MTCz 

Move To Coprocessor 



MFCZ 

Move From Coprocessor 

SLTU 

Set on Less Than Unsigned 

CTCz 

Move Control to Coprocessor 

AND 

AND 

CFCz 

Move Control From Coprocessor 

OR 

OR 

COPz 

Coprocessor Operation 

XOR 

Exclusive OR 

BCzT 

Branch on Coprocessor z True 

NOR 

NOR 

BCzF 

Branch on Coprocessor z False 


Shift Instructions 


System Control Coprocessor 

SLL 

Shift Left Logical 


(CPO) Instructions 

SRL 

Shift Right Logical 

MTCO 

Move To CPo 

SRA 

Shift Right Arithmetic 

MFCO 

Move From CPo 

SLLV 

Shift Left Logical Variable 

TLBR 

Read indexed TLB entry 

SRLV 

Shift Right Logical Variable 

TLBWI 

Write Indexed TLB entry 

SRAV 

Shift Right Arithmetic Variable 

TLBWR 

Write Random TLB entry 



TLBP 

Probe TLB for matching entry 



RFE 

Restore From Exception 
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Table 1. IDT79R3000A Instruction Summary 
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IDT79R3000A System Control Coprocessor (CPO) 

The IDT79R3000A can operate with up to four tightly- 
coupled coprocessors (designated CPO through CP3). The 
System Control Coprocessor (or CPO), is incorporated on the 
IDT79R3000 chip and supports the virtual memory system 
and exception handling functions of the IDT79R3000A. The 
virtual memory system is implemented using a Translation 
Lookaside Buffer and a group of programmable registers as 
shown in Figure 4. 

System Control Coprocessor (CPO) Registers 

The CPO registers shown in Figure 4 are used to control 
the memory management and exception handling capabilities 
of the IDT79R3000A. Table 2 provides a brief description of 
each register. 


Register 

Description 

EntryHi 

High half of a TLB entry 

EntryLo 

Low half of a TLB entry 

Index 

Programmable pointer into TLB array 

Random 

Pseudo-random pointer into TLB array 

Status 

Mode, interrupt enables, and diagnostic status info 

Cause 

Indicates nature of last exception 

EPC 

Exception Program Counter 

Context 

Pointer into kernel’s virtual Page Table Entry array 

BadVA 

Most recent bad virtual address 

PRId 

Processor revision identification (Read only) 
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Table 2. System Control Coprocessor (CPO) Registers 


SYSTEM COPROCESSOR 




| | Used with Virtual Memory System 

[Til Used with Exception Processing 
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Figure 4. The System Coprocessor Registers 
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Memory Management System 

The IDT79R3000A has an addressing range of 4 Gbytes. 
However, since most IDT79R3000A systems implement a 
physical memory smaller than 4Gbytes, the IDT79R3000A 
provides for the logical expansion of memory space by trans- 
lating addresses composed in a large virtual address space 
into available physical memory address. The 4 GByte address 
space isdivided into 2 GBytes which can be accessed by both 
the users and the kernel, and 2 GBytes for the kernel only. 

The TLB (Translation Lookaside Buffer) 

Virtual memory mapping is assisted by the Translation 
Lookaside Buffer (TLB). The on-chip TLB provides very fast 
virtual memory access and is well-matched to the require- 
ments of multi-tasking operating systems. The fully-associa- 


tive TLB contains 64 entries, each of which maps a 4-Kbyte 
page, with controls for read/write access, cacheability, and 
process identification. The TLB allows each user to access up 
to 2 Gbytes of virtual address space. 

Figure 5 illustrates the format of each TLB entry. The 
Translation operation involves matching the current Process 
ID(PID) and upper 20 bits of the address against PI D and VPN 
(Virtual Page Number) fields in the TLB. When both match (or 
the TLB entry is Global), the VPN is replaced with the PFN 
(Physical Frame Number) to form the physical address. 

TLB misses are handled in software, with the entry to be re- 
placed determined by as imple RANDOM function. The rou- 
tine to process a TLB miss in the UNIX environment requires 
only 1 0-1 2 cycles, which compares favorably with many CPUs 
which perform the operation in hardware. 


TLB ENTRY FORMAT 
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ENTRYHI ENTRYLO 


VPN - Virtual Page Number 
TLBPID - Process ID 
PFN - Physical Frame Number 
N - Non-cacheable flag 
D - Dirty flag (Write protect) 

V - Valid entry flag 
G - Global flag (ignore PID) 

0 - Reserved 


Figure 5. TLB Entry Format 
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IDT79R3000A Operating Modes 

The IDT79R3000A has two operating modes: User mode 
and Kernel/mode. The IDT79R3000A normally operates in 
the User mode until an exception is detected forcing it into the 
Kernel mode. It remains in the Kernel mode until a Restore 


From Exception (RFE) instruction is executed. The manner in 
which memory addresses are translated or mapped depends 
on the operating mode of the IDT79R3000A. Figure 6 shows 
the MMU translation performed for each of the operating 
modes. 
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MMU ADDRESS TRANSLATION 
VIRTUAL -> PHYSICAL 


OxFFFFFFFF 


OxCOOOOOOO 


OxAOOOOOOO 


0x80000000 

0X7FFFFFFF 


KERNEL 
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OxFFFFFFFF 
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MAPPED 

CACHEABLE 

(kseg2) 

any\ 







KERNEL 

UNMAPPED 




UNCACHED 

(ksegl) 



PHYSICAL 

MEMORY 

> 

KERNEL 

UNMAPPED 





CACHED 

(ksegO) 








0x20000000 

< 

KERNEL/USER 

N 


OxlFFFFFFF 

MEMORY 

0x00000000 

MAPPED 

CACHEABLE 

(kuseg) 

ANY \ 



V 



> 3584 MB 


y 512 MB 
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Figure 6. IDT79R3000A Virtual Address Mapping 


User Mode — in this mode, a single, uniform virtual ad- 
dress space (kuseg) of 2 Gbyte is available. Each virtual 
address is extended with a 6-bit process identifierfield to form 
unique virtual addresses. All references to this segment are 
mapped through the TLB. Use of the cache for up to 64 
processes is determined by bit settings for each page within 
the TLB entries. 

Kernel Mode — four separate segments are defined in 
this mode: 

• kuseg —' when in the kernel mode, references to this seg- 
ment are treated just like user mode references, thus 
streamlining kernel access to user data. 

• ksegO — references to this 512 Mbyte segment use cache 
memory but are not mapped through the TLB. Instead, they 
always map to the first 0.5 GBytes of physical address 
space. 

• ksegl — references to this 512 Mbyte segment are not 
mapped through the TLB and do not use the cache. 
Instead, they are hard-mapped into the same 0.5 GByte 
segment of physical address space as ksegO. 

• kseg2— references to this 1 Gbyte segment are always 
mapped through the TLB and use of the cache is deter- 
mined by bit settings within the TLB entries. 


IDT79R3000 Pipeline Architecture 

The execution of a single IDT79R3000A instruction con- 
sists of five primary steps: 


1) IF — Fetch the instruction (1-Cache). 

2) RD — Read any required operands from CPU 

registers while decoding the instruction. 

3) ALU — Perform the required operation on 

instruction operands. 

4) MEM — Access memory (D-Cache). 

5) WB — Write back results to register file. 

Each of these steps requires approximately one CPU 
cycle as shown in Figure 7 (parts of some operations overlap 
into another cycle while other operations require only 1/2 
cycle). 


IF 

RD 

ALU 



l-CACHE 

RF 

OP 

D-CACHE 


1 1 



1 


One Cycle 


Figure 7. IDT79R3000A Instruction Pipeline 
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INSTRUCTION EXECUTION 

The IDT79R3000A uses a 5-stage pipeline to achieve an 
instruction execution rate approaching one instruction per 
CPU cycle. Thus, execution of five instructions at a time are 
overlapped as shown in Figure 8. 


(5-Deep) 


1 IF 

RD 

ALU 

MEM 










IF 

R D 

ALU 

MEM 

WB 





IF 

FID 

ALU 

MEM 

WB 


/ 1 




1 



Instruction 1 

□a... 

IF 

R D 

ALU 

MEM 

WB 




1 



IF 

R D 

ALU 

MEM 

WB 




Current 

CPU 

Cycle 
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Figure 8. IDT79R3000A Execution Sequence 


IDT79R3000A INSTRUCTION PIPELINE 

This pipeline operates efficiently because different CPU 
resources (address and data bus accesses, ALU operations, 
register accesses, and so on) are utilized on a non-interfering 
basis. 

Memory System Hierarchy 

The high performance capabilities of the IDT79R3000A 
processor demand system configurations incorporating tech- 
niques frequently employed in large, mainframe computers 
but seldom encountered in systems based on more traditional 
microprocessors. 

A primary goal of systems employing RISC techniques is 
to minimize the average number of cycles each instruction 
requires for execution. In order to achieve this goal, RISC 
processors incorporate a numberof RISC techniques, includ- 
ing a compact and uniform instruction set, a deep instruction 
pipeline (as described above), and utilization of optimizing 
compilers. Many of the advantages obtained from these 
techniques can, however, be negated by an inefficient memory 
system. 

Figure 9 illustrates memory in a simple microprocessor 
system. In this system, the CPU outputs addresses to memory 
and reads instructions and data from memory or writes data to 
memory. The address space is completely undifferentiated: 
instructions, data, and I/O devices are all treated the same. In 
such a system, a primary limiting performance factor is memory 
bandwidth. 
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Figure 9. A Simple Microprocessor Memory System 


Figure 10 illustrates a memory system that supports the 
significantly greater memory bandwidth required to take full 
advantage of the IDT79R3000A’s performance capabilities. 
The key features of this system are: 
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Figure 10. An IDT79R3000A System with a 
High-Performance Memory System 
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• External Cache Memory — Local, high-speed memory 
(called cache memory) is used to hold instructions and data 
that is repetitively accessed by the CPU (for example, 
within a program loop) and thus reduces the number of 
references that must be made to the slower-speed main 
memory. Some microprocessors provide a limited amount 
of cache memory on the CPU chip itself. The external 
caches supported by the IDT79R3000A can be much 
larger; while a small cache can improve performance of 
some programs, significant improvements for a wide range 
of programs require large caches. 

• Separate Caches for data and Instructions— Even with 
high-speed caches, memory speed can still be a limiting 
factor because of the fast cycle time of a high-performance 
microprocessor. The IDT79R3000A supports separate 
caches for instructions and data and alternates accesses 
of the two caches during each CPU cycle. Thus, the 
processor can obtain data and instructions at the cycle rate 
of the CPU using caches constructed with commercially 
available IDT static RAM devices. 

In order to maximize bandwidth in the cache while minimiz- 
ing the requirement for SRAM access speed, the R3000A 
divides a single-processor clock cycle into two phases. 
During one phase, the address for the data cache access 
is presented while data previously addressed in the in- 
struction cache is read; during the next phase, the data 
operation is completed while the instruction cache is being 
addressed. Thus, both caches are read in a single proces- 
sor cycle using only one set of address and data pins. 

• Write Buffer — in orderto ensure data consistency, all data 
that is written to the data cache must also be written out to 
main memory. The cache write model used by the 
IDT79R3000A is that of a write-through cache; that is, all 
data written by the CPU is immediately written into the main 
memory. To relieve the CPU of this responsibility (and the 
inherent performance burden) the IDT79R3000A supports 
an interface to a write buffer. The I DT79R3020 Write Buffer 
captures data (and associated addresses) output by the 
CPU and ensures that the data is passed on to main 
memory. 

IDT79R3000A Processor Subsystem Interfaces 

Figure 1 1 illustrates the three subsystem interfaces pro- 
vided by the IDT79R3000A processor: 

• Cache control interface (on-chip) for separate data and 
instruction caches permits implementationofoff-chipcaches 
using standard IDT SRAM devices. The 79R3000Adirectly 
controls the cache memory with a minimum of external 
components. Both the instruction and data cache can vary 
from 0 to 256K Bytes (64K entries). The 79R3000A also 
includes the TAG control logic which determines whether 
or not the entry read from the cache is the desired data. The 
79R3000A cache controller implements a direct mapped 
cache for. high net performance (bandwidth). It has the 
ability to refill multiple words when a cache miss occurs, 
thus reducing the effective miss rate to less than 2% for 


large caches. When a cache miss occurs, the 79R3000A 
can support refilling the cache in 1, 4, 8, 16, or 32 word 
blocks to minimize the effective penalty of having to access 
main memory. The 79R3000A also incorporates the ability 
to perform instruction streaming; while the cache is refill- 
ing, the processor can resume execution once the missed 
word is obtained from main memory. In this way, the 
processor can continue to execute concurrently with the 
cache block refill. 

• Memory controller interface for system (main) memory. 
This interface also includes the logic and signals to allow 
operation with a write buffer to further improve memory 
bandwidth. In addition to the standard full word access, the 
memory controller supports the ability to write bytes and 
half-words by using partial word operations. The memory 
controller also supports the ability to retry memory ac- 
cesses if, for example, the data returned from memory is 
invalid and a bus error needs to be signalled. 

• Coprocessor Interface — The IDT79R3000A features a 
tightly coupled co-processor interface in which all co- 
processors maintain synchronization with the main pro- 
cessor; reside on the same data bus as the main proces- 
sor; and participate in bus transactions in an identical 
manner to the main processor. The IDT79R3000A gener- 
ates all required cache and memory control signals, includ- 
ing cache and memory addresses for attached 
coprocessors. As a result, only the data bus and a few 
control signals need to be connected to a coprocessor. 
The interface supports three types of coprocessor instruc- 
tions: loads/stores, coprocessor operations, and proces- 
sor-coprocessor transfers. Note that coprocessor loads 
and stores occur directly between the coprocessor and 
memory, without requiring the data to go through the CPU . 
Synchronization between the CPU and external 
coprocessors is achieved using a Phased-Lock Loop inter- 
face to the coprocessor. The coprocessor physical inter- 
face also includes coprocessor condition signals 
(CpCond(n)), which are used in coprocessor branch in- 
structions, and a coprocessor busy signal (CpBusy) which 
is used to stall the CPU if the coprocessor needs to hold off 
subsequent operations. 

Finally, a precise exception interface is defined between 
the CPU and coprocessors using the external interrupt 
inputs of the CPU. This allows a coprocessor exception, 
even if it was the result ot a multi-cycle operation, to be 
traced to the precise coprocessor operation which caused 
it. This is an important feature for languages which can 
define specific error handlers for each task. 

The interface supports up to four separate coprocessors. 
Coprocessor 0 is defined to be the system control 
coprocessor, and resides on the same chip as the CPU 
unit. Coprocessor 1 is the Floating Point Accelerator, IDT 
79R301 OA. Coprocessors 2 and 3 are available to support 
an interface to application specific functions. 
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MULTIPROCESSING SUPPORT 

The IDT79R3000A supports multiprocessing applications 
in a simple but effective way. Multiprocessing applications 
require cache coherency across the multiple processors. The 
I DT79R3000A offers two signals to support cache coherency: 
the first, MPStall, stalls the processor within two cycles of 
being received and keeps it from accessing the cache. This 
allows an external agent to snoop into the processor data 
cache. The second signal, MPInvalidate, causes the proces- 
sor to write data on the data cache bus which indicates the 
externally addressed cache entry is invalid. Thus, a subse- 
quent access to that location would result in a cache miss, and 
the data would be obtained from main memory. 

The two MP signals would be generated by a external 
logic which utilizes a secondary cache to perform bus snoop- 
ing functions. The 79R3000Adoes not impose an architecture 
for this secondary cache, but rather is flexible enough to 
support a variety of application specif ic architecture stand still 
maintain cache coherency. Further, there is no impact on 
designs which do not require this feature. The 79R3000A has 
further improved on the microprocessor support found in the 
79R3000, by allowing the use of cache RAMs with internal 
address latches in multiprocessor systems. 

ADVANCED FEATURES 

The IDT79R3000A offers a number of additional features 
such as the ability to swap the instruction and data caches, 
facilitating diagnostics and cache flushing. Another feature 
isolates the caches, which forces cache hits to occur regard- 
less of the contents of the tagf ields. The I DT79R3000A allows 
the processor to execute user tasks of the opposite byte 
ordering (endianness) of the operating system, and further 
allows parity checking to be disabled. More details on these 
features can be found in the IDT 79R3000A Family Hardware 
User’s Manual. 

Further features of the IDT79R3000A are configured 
during the last four cycles prior to the negation of the RESET 
input. These functions include the ability to select cache sizes 
and cache refill block sizes; the ability to utilize the multipro- 
cessor interface; whether or not instruction streaming is 
enabled; whether byte ordering follows “Big-Endian” or “Little- 
Endian” protocols, etc. Table 3 shows the configuration op- 
tions selected at Reset. These are further discussed in the 
“Hardware User’s Manual”. 

BACKWARD COMPATIBILITY WITH 79R2000 

The IDT79R3000A can be used in sockets designed for 
the 79R3000. The pin-out of the 79R3000A has been selected 
to ensure this compatibility, with new functions mapped onto 
previously unused pins. The instruction set is compatible with 
that of the 79R2000 at the binary level. As a result, code 
written forthe older processorcan be executed. New features 
can be selectively disabled. 

In most 79R3000 applications, the 79R3000A can be 
placed in the socket with no modification to initialization 
settings. Further application assistance on this topic is avail- 
able from IDT. 
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PACKAGE THERMAL SPECIFICATIONS 

The IDT79R3000A utilizes special packaging techniques 
to improve both the thermal and electrical characteristics of 
the microprocessor. 

In order to improve the electrical characteristics of the 
device, the package is constructed using multiple signal 
planes, including individual power planes and ground planes 
to reduce noise associated with high-frequency TTL parts. In 
addition, the 175-pin PGA package utilizes extra power and 
ground pins to reduce the inductance from the internal power 
planes to the power planes of the PC Board. 

In order to improve the electrical characteristics of the 
microprocessor, the device is housed using cavity down 
packaging. In addition, these packages incorporate a copper- 
tungsten thermal slug designed to efficiently transfer heat 
from the die to the case of the package, and thus effectively 
lower the thermal resistance of the package. The use of an 
additional external heat sink affixed to the package thermal 
slug further decreases the effective thermal resistance of the 
package. 

The case temperature may be measured in any environ- 
ment to determine whether the device is within the specified 
operating range. The case temperature should be measured 
at the center of the top surface opposite the package cavity 
(the package cavity is the side where the package lid is 
mounted). 

The equivalent allowable ambient temperature, Ta, can 
be calculated using the thermal resistance from case to 
ambient (Oca) forthe given package. The following equation 
relates ambient and case temperature; 

Ta = Tc - P*0ca 

where Pis the maximum powerconsumption, calculated by 
using the maximum Icc from the DC Electrical Characteristics 
section. 

Typical values for Oca at various airflows are shown in 
table 4 for the various CPU packages. 



Airflow - (ft/min) 

0 

200 

400 

600 

800 

1000 

0ca (175-PGA, 
144-PGA) 

21 

7 

3 

2 

1 

0.5 

0ca (172 Quad 
Flatpack) 

23 

9 

4 

3 

2.5 

1.5 
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Table 4. R3000A Package Characteristics 
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Input 

W Cycle 

X Cycle 

Y Cycle 

Z Cycle 

Into 

DBIkSizeO 

DBIkSizel 

Extend Cache 

Big Endian 

Inti 

IBIkSizeO 

IBIkSizel 
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NoCache 
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Reserved^ 1 ) 
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Multiprocessor 

BusDriveOn 

Int4 

PhaseDelayOn< 2 ) 

PhaseDelayOn' 2 > 

PhaseDeiayOn* 2 ) 

PhaseDelayOnt 2 ) 

Int5 

R3000 Mode* 2 ) 

R3000 Mode< 2 > 

R3000 Mode< 2 > 

R3000 Mode< 2 > 


NOTES: 2860 11)1 04 

1 . Reserved entries must be driven high. 

2. These values must be driven stable throughout the entire RESET period. 


Table 3. R3000A Mode Selectable Features 




Figure 11. IDT79R3000A Subsystem Interfaces Example; 64 KB Caches 
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PIN CONFIGURATION 
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172-Pln Flatpack(Top View) 


NOTES: 

1. Reserved pins must be connected. 

2. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time 
AdrLo 16: MP Invalidate, CpCond (2). 

AdrLo 17: MP Stall, CpCond (3). 
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PIN CONFIGURATION 
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175-Pin PGA (Top View) 


NOTE: 

1. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time 
AdrLo 16: MP Invalidate, CpCond (2). 

AdrLo 17: MP Stall, CpCond (3). 
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PIN CONFIGURATION 
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144-Pln PGA (Top View) 

NOTE: 

1. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time 
AdrLo 16: MP Invalidate. CpCond (2). 

AdrLo 1 7: MP Stall. CpCond (3). 
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PIN CONFIGURATION 
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Gnd 
Gnd 
Reset 
BusError 
RdBusv 
WrBusy 
CpBusy 


W4) 

M3) 

M2) 

intm 

lnt(6) 

Vcc 

AdrLo(1 7) 
Vcc 

AdrLo(16) 

Vcc 

CpCond(l) 
CpCond (0) 
Gnd 

AdrLo(15) 

Vcc 

AdrLo(14) 

Vcc 

AdrLo(13) 

Vcc 

Vcc 

AdrLo(12) 

Gnd 

AdrLo(ll) 

Gnd 

AdrLo(IO) 

Gnd 

AdrLo(9) 

AdrLo(8) 

AdrLo(7) 

AdrLo(6) 

AdrLo(5) 

Vcc 


NOTE: 

1. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time 
AdrLo 16: MP Invalidate, CpCond (2). 

AdrLo 1 7: MP Stall, CpCond (3). 
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I DT79 R3000A/A E RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


PIN DESCRIPTIONS 


Pin Name 

B3 

Description 

Data (0-31) 

I/O 

A 32-bit bus used for all instruction and data transmission among the processor, caches, memory interface, 
and coprocessors. 

DataP (0-3) 


A 4-bit bus containing even parity over the data bus. 

Tag (12-31) 

I/O 

A 20-bit bus used for transferring cache tags and high addresses between the processor, caches, and memory 
interface. 

TagV 


The tag validity indicator. 

Tag P (0-2) 

I/O 

A 3-bit bus containing even parity over the concatenation of TagV and Tag. 

AdrLo (0-17) 

o 

An 18-bit bus containing byte addresses used for transferring low addresses from the processor to the caches 
and memory interface. (AdrLo 16: CpCond (2), AdrLo 17: CpCond (3) set by reset initialization). 

IRdT 

0 

Read enable for the instruction cache. 

IWrT 

0 

Write enable for the instructon cache. 

]Rd2 

o 

An identical copy of IRdT used to split the load. 

1W?2 

o 

An identical copy of IWrl used to split the load. 

ICIk 

o 

The instruction cache address latch clock. This clock runs continuously. 

DRdl 

o 

The read enable for the data cache. 

DWrl 

o 

The write enable for the data cache. 

DRd2 

o 

An identical copy of DRdl used to split the load. 

DWr2 

o 

An identical copy of DWrl used to split the load. 

DCIk 

O 

The data cache address latch clock. This clock runs continuously. 

XEn 

o 

The read enable for the Read Buffer. 

AccTyp(0-2) 

o 

A 3-bit bus used to indicate the size of data being transferred on the data bus, whether or not a data transfer is 
occurring, and the purpose of the transfer. 

MemWr 

o 

Signals the occurrence of a main memory write. 

MemRd 

o 

Signals the occurrence of a main memory read. 

BusError 

1 

Signals the occurrence of a bus error during a main memory read or write. 

Run 

0 

Indicates whether the processor is in the run or stall state. 

Exception 

o 

Indicates that the instruction about to commit state should be aborted and other exception related information. 

SysOut 

o 

A reflection of the internal processor clock used to generate the system clock. 

CpSync 

o 

A clock which is identical to SysOut and used by coprocessors for timing synchronization with the CPU. 

RdBusy 

1 

The main memory read stall termination signal. In most system designs RdBusy is normally asserted and is 
deasserted only to indicate the successful completion of a memory read. RdBusy is sampled by the processor 
only during memory read stalls. 

WrBusy 

1 

The main memory write stall initiation/termination signal. 

CpBusy 

1 

The coprocessor busy stall initiation/termination signal. 

CpCond (0-1) 

1 

A 2-bit bus used to transfer conditional branch status from the coprocessors to the main processor. 

CpCond (2-3) 

1 

Conditional branch status from coprocessors to the processor. Function is provided on AdrLo 16/17 pins and is 
selected at reset time. 

MPStall 

1 

Multiprocessing Stall. Signals to the processor that it should stall accesses to the caches in a multiprocessing 
environment. This is physically the same pin as CpCond3; its use is determined at RESET initialization. 

MPInvalidate 

■ 

Multiprocessing Invalidate. Signals to the processor that it should issue invalidate data on the cache data bus. 
The address to be invalidated is externally provided. This is the same pin as CpCond2; its use is determined at 
RESET initialization. 

TrrT (0-5) 

1 

A 6-bit bus used by the memory interface and coprocessors to signal maskable interrupts to the processor. At 
reset time, mode select values are read in. 














































































IDT79R3000A/AE RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


PIN DESCRIPTIONS (Continued) 


Pin Name 

| g 

Description 

Clk2xSys 

i 

The master double frequency input clock used for generating SysOut. 

Clk2xSmp 

i 

A double frequency clock input used to determine the sample point for data coming into the processor and 
coprocessors. 

Clk2xRd 

i 

A double frequency clock input used to determine the enable time of the cache RAMs. 

Clk2xPhi 

i 

A double frequency clock input used to determine the position of the internal phases, phasel and phase2. 

Reset 

■ 

Synchronous initialization input used to force execution starling from the reset memory address. Reset must be 
deasseted synchronously but asserted asynchronously. The deassertion of Reset must be synchronized by the 
leading edge of SysOut. 
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ABSOLUTE MAXIMUM RATINGS 0 * 3) 



Rating 

Commercial 

Military 

fflifil 

Vterm 

Terminal Voltage 
with Respect 
toGND 

-0.5 to +7.0 

-0.5 to +7.0 

V 

Ta.Tc 

Operating 

Temperature 

0 to +70 (4) 
(Ambient) 

0 to +90< 5 > 
(Case) 

-55 to +125 
(Case) 

1 

Tbias 

Case Temperature 
Under Bias 

-55 to +125 (4) 
0 to +90< 5 > 

—65 to +135 

■ 

Tstg 

Storage 

Temperature 

-55 to +125 

-65 to +155 

°C 

IlN 

Input Voltage 

-0.5 to +7.0 

-0.5 to +7.0 

V 
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NOTE: 

1 . Stresses greater than those listed under ABSOLUTE MAXI MU M RATINGS 
may cause permanent damage to the device. This is a stress rating only 
and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended 
periods may affect reliability. 

2. ViN minimum = -3.0V for pulse width less than 15ns. 

Vin should not exceed Vcc +0.5 Volts. 

3. Not more than one output should be shorted at a time. Duration of the short 
should not exceed 30 seconds. 

4. 16-33 MHz only. 

5. 37-40 MHz only. 


AC TEST CONDITIONS 


Symbol 

Parameter 

Min. 

Max. 

Unit 

VlH 

Input HIGH Voltage 

3.0 

— 

V 

VlL 

Input LOW Voltage 

— 

0.4 

V 

VlHS 

Input HIGH Voltage 

3.5 

— 

V 

VlLS 

Input LOW Voltage 

— 

0.4 

V 


RECOMMENDED OPERATING 
TEMPERATURE AND SUPPLY VOLTAGE 


Grade 

Temperature 

GND 

Vcc 

Military 

16-33 MHZ 

-55°C to +125°C 
(Case) 

OV 

5.0 ±10% 

Commercial 
16-33 MHz 

0°C to +70°C 
(Ambient) 

ov 

5.0 ±5% 

Commercial 
37-40 MHz 

0°C to +90°C 
(Case) 

ov 

5.0 ±5% 
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OUTPUT LOADING FOR AC TESTING 
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IDT79R3000A/AE RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS— 

COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V±5%) 


Symbol 

Parameter 

Test Conditions 

79R3000A 

79R3000AE 

Unit 

16.67MHz 

20.0MHz 

25.0MHz 

33.33MHz 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

VOH 

Output HIGH Voltage 

Vcc = Min., Ioh = -4mA 

3.5 

— 

3.5 

— 

3.5 

— 

3.5 

— 

D 

VOL 

Output LOW Voltage 

Vcc « Min., Iol - 4mA 

— . 

ESI 

— 

0.4 

— 

0.4 

— . 

0.4 

D 

Vohc 

Output HIGH Voltage* 7 * 

Vcc = Min., Ioh = -4mA 

ESI 

— 


— 

4.0 

— 

ESI 

— 

D 

Voht 

Output HIGH Voltage* 4 6 * 

Vcc - Min., Ioh = -8mA 

m 

— 

El 

— 

El 

— 

El 

— 

O 

Volt 

Output LOW Voltage* 4 - 6 * 

Vcc - Min., Iol = 8mA 

— 

0.8 

— 

0.8 

— 

0.8 

— 

0.8 

D 

Vih 

Input HIGH Voltage* 5 * 


2.0 

— 

2.0 

— 

2.0 

— 

2.0 

— 

D 

Vil 

Input LOW Voltage* 1 * 


— 

0.8 

— 

0.8 

— 

0.8 

— 

0.8 

El 

Vihs 

Input HIGH Voltage* 2 - 5 * 


3.0 

— 


— 


— 

3.0 

— 


Vils 

Input LOW Voltage* 1 - 2 * 


— 

ESI 

— 

0.4 

— 

0.4 

— 

0.4 

m 

ClN 

Input Capacitance* 6 * 


— 

10 

— 

10 

— 

10 

— 

10 

m 

Cout 

Output Capacitance* 6 * 


— 

10 

— 

10 

— 

10 

— 

10 

El 

Icc 

Operating Current 

Vcc = 5V, Ta = 70°C 

— 

450 

— 

550 

— 

650 

— 

750 

mA 

IlH 

Input HIGH Leakage* 3 * 

Vih = VCC 

— 

100 

— 

100 

— 

100 

— 

100 

I2J 

IlL 

Input LOW Leakage* 3 * 

Vil = GND 

-100 

— 

-100 


-100 

— 

-100 

— 


loz 

Output Tri-state Leakage 

Voh = VCC, Vol = GND 

-100 

100 

-100 

100 

-100 

100 

-100 

100 

BO 
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NOTES: 

1 . Vil Min. = -3.0V for pulse width less than 1 5ns. Vil should not fall below -0.5 V olts for larger periods. 

2. Vihs and Vus apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. 

4. Voht and Volt apply to the bidirectional data and tag busses only. Note that Vih and Vil also apply to these signals. Voht and Volt are provided 
to give the designer further information about these specific signals. 

5. Vih should not be held above Vcc + 0.5 volts. 

6. Guaranteed by design . 

7. Vohc applies to RUN and Exception. 
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1DT79R3000A/AE RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS— 

MILITARY TEMPERATURE RANGE (Tc = -55°C to +125°C, Vcc = +5.0V±10%) 


Symbol 

Parameter 

Test Conditions 

79R3000A 

79R3000AE 

Unit 

16.67MHz 

20.0MHz 

25.0MHz 

33.33MHz 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

VOH 

Output HIGH Voltage 

Vcc = Min., Ioh = -4mA 

3.5 

— 

3.5 

— 

3.5 

— 

3<5 

— 

D 

VOL 

Output LOW Voltage 

Vcc = Min., Iol = 4mA 

— 


— 

E 

— 

0.4 


0.4 

ni 

Vohc 

Output HIGH Voltage* 7 ) 

Vcc - Min., Ioh = -4mA 

4.0 

— 

ESI 

— 

4.0 

Tif 

4P, 

— 

D 

Voht 

Output HIGH Voltage* 4 - 6 ) 

Vcc * Min., Ioh = -8mA 

m 

— 

El 

— 

El 


El 

— 

D 

Volt 

Output LOW Voltage* 4 - 6 ) 

Vcc = Min., Iol = 8mA 

— 

0.8 

— 

0.8 

— % 

0.8 

— 

0.8 

D 

Vih 

Input HIGH Voltage* 5 ) 


2.0 

— 

wrm 

— 


— 

2.0 

— 


Vil 

Input LOW Voltage* 1 ) 


— 

0.8 

— 

0.8 i 

— 

* 0.8 

— 

0.8 

D 

Vihs 

Input HIGH Voltage* 2 - 5 ) 


3.0 

— 

ill 


3.0 

— 

3.0 

— 

D 

Vils 

Input LOW Voltage* 1 - 2 ) 


— 

0.4 



!•_ 

0.4 

— 

0.4 


ClN 

Input Capacitance* 6 ) 


— 

10 



— 

10 

— 

10 


Cout 

Output Capacitance* 6 ) 


— 

10 

— 

#10 

— 

10 

— 

10 


Icc 

Operating Current 

Vcc = 5V, Ta = 70°C 

— 

500- 

./■“V 

600 

— 

650 

— 

750 


IlH 

Input HIGH Leakage* 3 ) 

Vih = VCC 

— 

f 

ipjll 

%- 

100 

— 

100 

— 

100 

Q| 

IlL 

Input LOW Leakage* 3 ) 

Vil = GND 

-100 


-100 

— 

-100 

— 

-100 

— 


loz 

Output Tri-state Leakage 

Voh = VCC, Vol = GND 

-100 

100 

-100 

100 

-100 

100 

-100 

100 

M 
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NOTES: 

1 . Vil Min. = -3.0V for pulse width less than 1 5ns. Vil should not fall below -0.5 V olts for larger periods. 

2. Vihs and Vus apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. 

4. Voht and Volt apply to the bidirectional data and tag busses only. Note that Vih and Vil also apply to these signals. Voht and Volt are provided 
to give the designer further information about these specific signals. 

5. Vih should not be held above Vcc + 0.5 volts. 

6. Guaranteed by design . 

7. Vohc applies to RUN and Exception. 


















































































































IDT79R3000A/AE RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS— 

COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V ±5%) 


Symbol 

Parameter 

Test Conditions 

79R3000AE 

Unit 

37.0MHz 

40.0MHz 

Min. 

Max. 

Min. 

Max. 

VOH 

Output HIGH Voltage 

Vcc - Min., Ioh = -4mA 

3.5 

— 

3.5 

1 . — 

D 

VOL 

Output LOW Voltage 

Vcc - Min., Iol » 4mA 

— 

0.4 

— 

0.4 

a 

Vohc 

Output HIGH Voltage* 7 * 

Vcc - Min., Ioh - -4mA 

4.0 

— 

4.0 

— 

m 

Voht 

Output HIGH Voltage* 4 - 6 ) 

Vcc = Min., Ioh = -8mA 

2.4 

— 

<.2.4. "\j 

* — 

a 

Volt 

Output LOW Voltage* 4 - 6 ) 

Vcc - Min., Iol = 8mA 

— 

0.8 


0.8 

m 

Vih 

Input HIGH Voltage* 5 ) 


2.0 

— 4 


— 

m 

Vil 

Input LOW Voltage* 1 ) 


— 

0.8 

| J'— 

0.8 

m 

Vihs 

Input HIGH Voltage* 2 ' 5 ) 


3.0 

— 

^ 3.0 

— 

a 

Vils 

Input LOW Voltage* 1 ' 2 ) 


— 

;0.4 ^ 

— 

0.4 

V 

ClN 

Input Capacitance* 6 ) 


— 

1 10" 

— 

10 

pF 

Com 

Output Capacitance* 6 ) 


— J 

10 

— 

10 

pF 

Icc 

Operating Current 

Vcc - 5V, Ta - 70°C 


% ;v 825 

— 

850 


IlH 

Input HIGH Leakage* 3 ) 

Vih - VCC 

— 

100 

— 

100 

D9 

III 

Input LOW Leakage* 3 ) 

Vil . GND 

-100 

— 

-100 

— 


loz 

Output Tri-state Leakage 

Voh = VCC, VOL = GND 

-100 

100 

-100 

100 
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NOTES: 

1. Vil Min. = -3.0V for pulse width less than 15ns. Vil should not fall below -0.5 V olts for larger periods. 

2. Vihs and Vus apply to Clk2xSys, CIk2xSmp, Clk2xRd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. 

4. Voht and Volt apply to the bidirectional data and tag busses only. Note that Vih and Vil also apply to these signals. Voht and Volt are provided 
to give the designer further information about these specific signals. 

5. Vih should not be held above Vcc + 0.5 volts. 

6. Guaranteed by design . 

7. Vohc applies to RUN and Exception. 
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IDT79R3000A/AE RISC CPU PROCESSOR 


MIUTARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS* 1 - 2 - 3 )— 

COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V ±5%) 


79R3000A 


16.67MHz 


Min. Max. 


Symbol 

Parameter 

| Clock 

TCkHigh 

Input Clock High* 2 ) 

TCkLow 

Input Clock Low* 2 ) 

TCkP 

Input Clock Period* 2 ) 

Clk2xSys to Clk2XSmp* 6 > 
Clk2xSmp to Clk2xRd* 6 ) 
Clk2xSmp to Clk2xPhi* 6 ) 


79R3000AE 


Run Operation 


TDEn Data Enable* 3 ) 


Tddis Data Disable* 3 ) 


TDVai Data Valid 


TWrOly Write Delay 


Data Set-up 


Data Hold* 3 ) 


CpBusy Set-up 


CpBusy Hold 


Access Type (1:0) 


Access Type (2) 


Memory Write 


Exception 


Address Valid 


lnt(n) Set-up 


lnt(n) Hold 


Stall Operation 


Reset Initialization 


Test Conditions 


20.0MHz 


Max. 




25.0MHz 


Max. 


33.33MHz 


Min. Max 



Load= 25pF 


Load= 25pF 


Load= 25pF 


Load= 25pF 


Load= 25pF 



— 

8 

— 

8 

500 

tcyc/4 

tcyc/4 

tcyc/4 

20 

0 

0 

5 


-2 

— 

-1 

— 

3 

— 

4 

— 

_ 

6 

— 

i m 

— 

9 1 

— 


6 

— 

14 

— 

23 

— 

7 

— 

2 

_ 

— 

6 

— 

EE1 



TSAVal 

Address Valid 

Load= 25pF 

— 

30 

— 

23 


20 

— 

15 J 

TSAcTy 

Access Type 

Load= 25pF 

— 

mm 

— 


Kill 

18 

— 

IEE3I 

TMRdi 

Memory Read Initiate 

Load= 25pF 

1 

27 

1 

23 

i 

18 

1 

13.5 

TMRdt 

Memory Read Terminate 

Load= 25pF 

— 

27 

— 

23 

— 

18 

— 

10 

Tsti 

Run Terminate 

Load= 25pF 

3 

17 

wm 

15 

Dl 

10 

2 

mm\ 

TRun 

Run Initiate 

Load= 25pF 

— 

7 

— 

6 

— 4 

— 

3 

TSMWr 

Memory Write 

Load= 25pF 

3 

27 

3 

23 

3 

18 

2 

9.5 

TSExc 

Exception Valid 

Load= 25pF 

— 

15 

— 

mm 

— 

10 

— 

mm\ 


Trst. 

Reset Pulse Width 

TrstPLL 

Reset timing, Phase-lock on* 4 ' 5 ) 

Trstcp 

Reset timing, Phase-lock off* 4 ' 5 ) 

Capacitive Load Deration 

CLD 

Load Derate* 6 * j 


IBIElEnE9i££&3i 


ns/25pF 
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NOTES: 

1. All timings are referenced to 1 .5V. 

2. The dock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. These parameters apply when the 79R3010 Floating Point Coprocessor is connected to the CPU. With phase lock on, Reset must be asserted 
for the longer of 3000 clock cydes or 200 microseconds. 

5. Tcyc is one CPU clock cy de (tw o cycles of a 2x clock). 

6. With the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 15%. 

7. Clock transition time < 2.5ns for 33.33MHz; clock transition time < 5ns for other speeds. 
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IDT79R3000A/AE RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS * 1 * 2 - 3 >— 

MILITARY TEMPERATURE RANGE (Tc = -55°C to +125°C, Vcc = +5.0V ±10%) 


79R3000A 


Symbol 

Parameter 

| Clock 

TCkHigh 

Input Clock High* 2 ) 

TCkLow 

Input Clock Low* 2 ) 

TCkP 

Input Clock Period* 2 ) 

Clk2xSys to Clk2XSmp< 6 ) 
Clk2xSmp to Clk2xRd< 6 ) 
Clk2xSmp to Clk2xPhi* 6 ) 


79R3000AE 


Tost Conditions 


16.67MHz 


Min. Max. 


20.0MHz 


25.0MHz 


Max. 




Run Operation 


TDEn Data Enable* 3 ) 


Tddis Data Disable* 3 ) 


TDVal Data Valid 


TWrDiy Write Delay 


Data Set-up 


Data Hold* 3 ) 


CpBusy Set-up 


CpBusy Hold 


Access Type (1:0) 


Access Type (2) 


Memory Write 


Exception 


Address Valid 


lnt(n) Set-up 


lnt(n) Hold 


Stall Operation 



Load- 25pF 


Load- 25pF 


Load- 25pF 


Load- 25pF 


Load- 25pF 


TSAVal 

Address Valid 

Load- 25pF 

- 

30 

r=n 

23 « 

e 

20 

— 

15 

ns 

TSAcTy 

Access Type 

Load- 25pF 

■ 

Kl 

— 

IES 

ns 

TMRdi 

Memory Read Initiate 

Load- 25pF 

i 

2# 


23 

— 

18 

~ 

13.5 

ns 

TMRdt 

Memory Read Terminate 

Load- 25pF 

— 

27 

IWI 

Jl3 

— 

18 

— 

10 

ns 

Tstl 

Run Terminate 

Load- 25pF 

3 

17. 


f 15 

MM 

10 

2 

wrm 

ns 

TRun 

Run Initiate 

Load- 25 pF 


6 

— 

4 

— 

3 

ns 

TSMWr 

Memory Write 

Load- 25pF 

MB 

. Z7* 

wm 

23 


18 

2 

9.5 

ns 

TSExc 

Exception Valid 

Load- 25pF 

B' 

mm 


■El 

jBj 

mm 

— 

IQSI 

ns 


Reset Initialization 


Trst 

Reset Pulse Width 

TrstPLL 

Reset timing, Phase-lock on* 4 ’ 5 ) 

Trstcp 

Reset timing, Phase-lock off* 4 ’ 5 ) 


IBnESBESBini 


128 

- 

128 

- 

128 


Capacitive Load Deration 


CLD Load Derate* 6 ) 




ns/25pF 
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NOTES: 

1 . All timings are referenced to 1 .5V. 

2. The clock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. These parameters apply when the 79R3010 Floating Point Coprocessor is connected to the CPU. With phase lock on, Reset must be asserted 
for the longer of 3000 clock cycles or 200 microseconds. 

5. Tcyc is one CPU clock cy cle (tw o cycles of a 2x clock). 

6. With the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 1 5%. 

7. Clock transition time < 2.5ns for 33.33MHz; clock transition time < 5ns for other speeds. 
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AC ELECTRICAL CHARACTERISTICS^ 1 - 2 ’ 3 )— 

COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V±5%) 


79R3000AE 


Symbol 


Clock 


TckHigh Input Clock High® 


TckLow Input Clock Low® 


TCkP 


Run Operation 


TDEn Data Enable* 3 ) 


Tddis Data Disable* 3 ) 


TDVal Data Valid 


TwrDiy Write Delay 


Data Set-up 


Data Hold* 3 ) 


CpBusy Set-up 


CpBusy Hold 


Access Type (1:0) 


Access Type (2) 


Memory Write 


Exception 


Address Valid 


lnt(n) Set-up 


lnt(n) Hold 


Stall Operation 


Reset Initialization 


Test Conditions 




Load= 25pF 


Load= 25pF 


Load= 25pF 


Load= 25pF 


Load= 25pF 


TSAVal 

Address Valid 

Load= 25pF 

TSAcTy 

Access Type 

Load= 25pF 

TMRdi 

Memory Read Initiate 

Load= 25pF 

TMRdt 

Memory Read Terminate 

Load= 25pF 

Tstl 

Run Terminate 

Load= 25pF 

TRun 

Run Initiate 

Load= 25pF 

TSMWr 

Memory Write 

Load= 25pF 

TSExc 

Exception Valid 

Load= 25pF 


Trst 

Reset Pulse Width 

TrstPLL 

Reset timing, Phase-lock on* 4 ' 5 ) 

Trstcp 

Reset timing, Phase-lock off* 4 ' 5 ) 

Capacitive Load Deration 

CLD 

Load Derate* 6 ) j 



12.5 

ns 

9 

ns 

9 

ns 

9 

ns 

6 

ns 

3 

ns 

9 

ns 

6 

ns 


NOTES: 

1 . All timings are referenced to 1 .5V. 

2. The clock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. These parameters apply when the 79R3010 Floating Point Coprocessor is connected to the CPU. With phase lock on, Reset must be i 
for the longer of 3000 clock cycles or 200 microseconds. 

5. Tcyc is one CPU clock cy cle (tw o cycles of a 2x clock). 

6. With the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 15%. 

7. Clock transition time < 2.5ns. 


ns/25pF 
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Figure 13. Processor Reference Clock Timing 

* These signals are not actually output from the processor. 

They are drawn to provide a reference for other timing diagrams. 
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Figure IS. Memory Write Timing 
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Figure 16. Memory Read Timing 
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SysOut 


PhiOut 


Data Bus 


Run 

CpBusy 


Exception 

CpCond(n) 


Coprocessor Store 

Coprocessor Load 

1 

2 

1 
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Figure 18. 

Interrupt Timing 
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Figure 19. Mode Vector Initialization 


NOTES: 

1 . Reset must be neg ated s ynchronously; however, it should be asserted asynchronously. Designs must not rely on the proper functioning of SysOut prior 

to the assertion of R eset. 

2. If Phase-Lock On or R3000 Mode are asserted as mode select options, they should be asserted throughout the Reset period, to insure that the slowest 
coproc essor in the system has sufficient time to lock to the CPU clocks. 

3. Reset is actually sampled in both Phase 1 and Phase 2. To insure proper initialization, it must be negated relative to the end of Phase 1 . 
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ORDERING INFORMATION 


IDT 



Blank Commercial (0°C to +70°C) 

B Military (-55°C to + 1 25°C) 

Compliant to MIL-STD-883, Class B 


M 

Military Temperature Range Only 

GDI 75 

175-Pin PGA (Cavity Down) 

GD144 

F 

144-Pin PGA (Cavity Down) 
172-Pin Flat Pack (Cavity Down) 

PM 

1 60-Pin Plastic Quad Flat 

16 

16.67 MHz 

20 

20.0 MHz 

25 

25.0 MHz 

33 

33.33 MHz 

37 


40 


79R3000A 

RISC CPU Processor 

79R3000AE 

Enhanced Timing Version 
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VALID COMBINATIONS 

IDT 79R3000A- 16, 20 All packages 

79R3000A - 16, 20B, M GD144, GD 175, F 

79R3000AE - 25, 33 B, M GD144, GD 175, F 

79R3000AE - 25, 33, 37, 40 GDI 44, GD 1 75, F 
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IDT79R3001 



Integrated Device Technology, Inc. 


RISController™ 

CPU FOR HIGH-PERFORMANCE 
EMBEDDED SYSTEMS 


FEATURES: 

• Enhanced Instruction Set compatible version of 
IDT79R3000 RISC CPU 

• Achieves high-performance with reduced parts count 
and lower overall system cost 

• Flexible on-chip cache controller supports various 
cache, main memory sizes 

• Supports optional data parity with parity error output 
signal 

• Works with IDT79R3010 RISC Floating-Point 
Coprocessor 

• DMA interface support 

• Large synchronous memory space for real-time systems 

• Full 32-bit operations — 32-bit registers, 32-bit address 
and data interface 

• On-chip memory management unit with 64 fully associa- 
tive TLB entries maps 4 Gbyte virtual address space 

• High-speed interrupt response (6 interrupt input pins) 
with precise exception capability 

• High-speed CEMOS™ technology results in speeds 
from 12.5 to 40MHz 


• Supports caches from 8 Kbytes to 1 6Mbytes 

• Independent block refill sizes for the instruction and data 
caches 

• Concurrent cache refill and execution 

• Works on 8-, 1 6- and 32-bit data 

• Supports unaligned 32-bit data 

• Optimizing compilers for C, Ada, Pascal, Fortran 

• RTOS support for C or Ada environments 

DESCRIPTION: 

The IDT79R3001 brings the high-performance inherent in 
the IDT79R3000 RISC Microprocessorto lowercost systems. 
It does this while maintaining full (both User and Kernel) 
software compatibility with both the IDT79R2000A and 
IDT79R3000 RISC Microprocessors. 

The IDT79R3001 achieves lower system cost by reducing 
the number of components required to construct a synchro- 
nous memory (or cache) external to the processor and by 
simplifying the asynchronous memory interface. By removing 
the requirement for parity and allowing the system designer to 
select the cache organization which best suits the system, 


FUNCTIONAL BLOCK DIAGRAM 


CPO CONTROL CPU 



ADDRESS (24) 
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overall parts count is dramatically reduced while maintaining 
high performance. 

The IDT79R3001 RISC Microprocessor extends the ability 
of the IDT79R3000 family to support embedded and cost 
sensitive applications. Its level of integration and flexibility 
allows high-performance systems to be constructed at rea- 
sonable cost in a straightforward manner, without forcing the 
system designer to support features not required in his appli- 
cation. 

ThelDT79R3001 consistsoftwotightly coupled processors 
integrated on a single chip. The first processor is a full 32-bit 
CPU based on RISC principles to achieve a new standard of 
performance in microprocessor based systems. The second 
processor is a system control co-processor, called CPO, 
containing a fully associative 64-entry TLB (Translation 
Lookaside Buffer), MMU (Memory Management Unit), and 
control registers, supporting a 4 Gigabyte virtual memory 
subsystem and a Harvard Architecture Synchronous Memory/ 
Cache controller which achieves ultra-high bandwidth using 
industry standard SRAM devices. 

This data sheet provides an overview of the features and 
architecture of the IDT79R3001 CPU. A more detailed 
description of the operation and timing of this device is 
incorporated in the “IDT79R3001 Hardware User's Guide,” 
and a detailed architectural overview is provided in the “mips 
RISC Architecture" book, both available from IDT. Further 
literature describing the hardware, software, and develop- 
ment tools for the IDT79R3001 are also available from IDT. 

HARDWARE OVERVIEW 

The IDT79R3001 is a high-performance RISC micropro- 
cessor incorporating a fast execution engine and sophisti- 
cated yet flexible memory interface designed to support the 
processor bandwidth requirements at minimal system cost. 

Execution Engine 

ThelDT79R3001 containsthe same basic execution engine 
as the ultra-high performance I DT79R3000 and thus achieves 
over 28 MIPS performance at 33 MHz. 

Thekeytothe performance of the processor is the instruct ion 
pipeline, illustrated in Figure 2. The execution of a single 
IDT79R3001 instruction consists of five primary steps, some 
of which may be broken down further into smaller subsets. 

The five primary stages of the pipeline, each of which 
require approximately one CPU cycle, are: 

IF Instruction Fetch, when the processor fetches the 
instruction from the Instruction Synchronous 
Memory. 

RD Read required operands from on-chip register file 
while decoding the instruction. 

ALU Perform the required operation on instruction 
operands. 

MEM Access data memory (load or store). 

WB Write results back to register file. 


IF 

RD 

ALU 

MEM 

WB 

1-Memory 

Reg. 

File 

Operation 

D-Memory 

Write 

Back 

1 1 





One Cycle 


Figure 2. IDT79R3001 Five-Stage Pipeline 


Thus, the CPU achieves an average execution rate 
approaching one instruction per CPU cycle, since the execu- 
tion of five instructions at a time are overlapped within the 
processor (Figure 3). Optimizing compiler technology fully 
comprehends the interaction of software with the various 
pipeline resources, and serves to both eliminate any potential 
pipeline conflicts which might arise and to maximize instruc- 
tion throughput. 

The IDT79R3001 Memory Interfaces 

The key to achieving the inherent performance of the 
IDT79R3001 is to design a memory subsystem capable of 
providing a new instruction to the processor on almost every 
clock cycle. 

Like the IDT79R3000, the IDT79R3001 supports a hierar- 
chical view of the memory subsystem. However, the 
IDT79R3001 allows the system designerto make more trade- 
offs in the partitioning and architecture of the various levels in 
order to more completely meet the needs of certain types of 
applications. 

The IDT79R3001 supports two classifications of external 
memory: synchronous and asynchronous. The Harvard- 
Architecture (separate instruction and data memories) syn- 
chronous memory allows the processorto achieve the highest 
levels of performance. The processor is able to obtain both an 
instruction and data word from the synchronous memory on 
every clock cycle, resulting in high instruction and data 
throughput. 
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Figure 3. Instruction Execution in I DT79R3001 Pipeline 
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The asynchronous memory space contains larger, slower 
memory devices such as EPROM, main memory DRAMs, 
and peripheral devices. Multiple clock cycles are required for 
data movement in the asynchronous memory. 

Many systems implement a memory hierarchy between 
thesetwo memory spaces, whereby the synchronous memory 
space is used as processor caches and the asynchronous 
memory space is used for main memory. The IDT79R3001 
integrates a flexible Direct-Mapped Cache Controller On- 
Chip, eliminating external cache control logic and minimizing 
cache management overhead. If the synchronous memory 
space is used for processor caches, then cache “misses” will 
cause the processor to automatically process an asynchro- 
nous memory transfer to refill the cache. 

The key to achieving the system cost and performance 
goals of an IDT79R3001 -based system is to partition the 
memory system to the needs of the application. 


Synchronous Memory System 

As with any high-performance processor, the IDT79R3001 
requires high-bandwidth to achieve high-performance. Thus, 
it is important that the majority of its execution occur in the 
synchronous memory space. In applications which require 
substantial amounts of main memory, this memory space will 
be implemented as instruction and data caches. 

The synchronous memory is designed to be able to supply 
both an instruction and data word to the processor on each 
clock cycle. When the synchronous memory spaces are used 
as caches, then they are used to hold instruction and data that 
is repetitively accessed by the CPU (for example, within a 
program loop). This reduces the number of slower asynchro- 
nous memory cycles and thus achieves higher performance. 

Some microprocessors incorporate small amounts of cache 
on-chip, which has a very small and unpredictable effect on 
the execution of large programs. The IDT79R3001 supports 
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Figure 4. Synchronous Memory Control Timing 


caches of from 8kB in size up through 16MB, thus bringing 
substantial performance improvements to very large pro- 
grams and also allowing real-time system designers to design 
cache-based systems to support deterministic requirements. 

ThelDT79R3001 directly controls the synchronous memory 
interface (whether it is being used as caches or not) with a 
minimum of external components. The IDT79R3001 includes 
all control signals and cache TAG control logic (for a direct 
mapped cache) forthe synchronous memory interfaces. Parity 
over the data portion of each synchronous memory can be 
optionally selected at RESET time for applications which 
desire to make this cost trade-off. 


The synchronous interface works by dividing the basic 
CPU cycles into two phases. During one phase, a cache 
address is presented by the processor and captured by 
external latches (the latch control signals are directly gener- 
ated by the CPU). During the next phase, the address forthe 
other memory space is generated and captured while the data 
movement operation or the first cache is completed. The 
processor directly generates the SRAM Output Enable and 
Write Enable signals and the address latch enable signals, 
requiring no external decoding. This is illustrated in Figure 4. 

Further, the IDT79R3001 supportsthe ability to refill multiple 
words into the cache from main memory when a cache-miss 
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occurs, further reducing system cost and increasing perfor- 
mance in cache-based systems. ThelDT79R3001 canobtain 
1 , 4, 8, 1 6, or 32 words from main memory when processing 
a cache-miss, thus amortizing the cache-miss penalty over a 
large amount of data. 

ThelDT79R3001 also performs instruction streaming, which 
is the simultaneous execution of incoming instructions while 
the cache is being refilled. 

The actual width of the tag bus, and whether or not parity 
over the data parts of each synchronous memory is included, 
is determined according to how the device is initialized. The 
IDT79R3001 can accommodate a TAG bus width of 0-19 bits, 
compatible with a variety of cache sizes and cacheable main 


memory choices. The IDT79R3001 allows the system de- 
signer to scale the synchronous memory system exactly 
according to the system needs, thus eliminating extra memory 
and logic devices and achieving substantial cost savings with 
no loss of performance. 

Thus, the synchronous memory interface of the IDT79R3001 
allows for high-bandwidth memory systems to be implemented 
with a minimum of control logic. This is desirable, since RISC 
performance tends to be a function of memory bandwidth. By 
simplifying the design of the synchronous memory system 
(illustrated in Figure 5), it is easier for the system designer to 
achieve high performance with minimum chip count and 
without requiring ultra-fast or specialty components. 



Figures. IDT79R3001 Synchronous Interface 


The TAG Bus 

The TAG bus of the IDT79R3001 has been designed to 
allow the system designer to implement the exact cache 
configuration that is right for the system. For larger caches, 
low-order TAG bits do not need to be supplied for the TAG 
comparison. Additionally, the number of high-order TAG bits 
supplied is determined by the system designer, according to 
the amount of cacheable main memory the system supports. 
Since most embedded systems would tend to implement 
caches of 1 6KB and greater, and cacheable memory spaces 
of 32MB or smaller, significant cost and area reductions are 
achieved by configuring a smaller TAG bus. 


The system configures the on-chip TAG comparator at 
RESET Initialization time. If a TAG bit is not to be included in 
the synchronous memory TAG bit compare, a pull-down 
resistor of 4kft is connected to the appropriate IDT79R3001 
TAG pin. If a TAG bit is to be included, no resistor is required 
(the IDT79R3001 pulls floating inputs to Vcc during RESET 
by a small pull-up, which is disabled when RESET is negated). 

If a TAG bit is excluded from the cycle-by-cycle compari- 
son, it is still driven out with the appropriate address value 
during write cycles or asynchronous memory reads. Thus, the 
system designer still has the full 4 Gbyte of address space 
available for address decoding, without requiring the syn- 
chronous memory to be able to cache all such addresses. 
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Figure 6 illustrates a reduced system, which implements 
16KB of Instruction and 16KB of data cache, and 512MB of 
cacheable address space, using just 6 IDT71586 4Kx16 
Latched Cache RAM™ components and4pull-downresistors. 

Note that in systems which do not implement the syn- 
chronous memory space as cache, then pull-down resistors 


would be added to all TAG pins. The Valid Pin still needs to 
be supplied on each cycle, thus allowing various memory 
schemes to be implemented (such as static column DRAM). 
However, the IDT79R3001 can be initialized to not assert the 
Valid pin as an output during Write cycles, simplifying the 
design of logic to drive the signal. 


IDT79R3001 

RISController 




Figure 6. Small Footprint Cache for IDT79R3001 


Cache Update 

When the on-chip TAG comparator indicates that the item 
read from the cache was not the desired item, a cache-miss 
is processed. A main memory (asynchronous) transfer is 
automatically processed. 

The IDT79R3001 desires to update the cache using a burst 
refill of multiple adjacent words from main memory. The 
processor is “stalled” until the first word of the block is 
available. The processor is then released, and the block of 
words is brought into the cache at the rate of one word per 
CPU clock cycle. 

Note that if the cache-miss was in the instruction cache , the 
processor is capable of simultaneously executing the incoming 
instruction stream as the cache is updated, thus effectively 
making the cache update transparent to the system and 
increasing performance. 

Write Cycles 

The 1DT79R3001 utilizes a write through cache. That is, 
data written by the processor is both written to the cache and 


main memory simultaneously. Thus, main memory always 
has a current copy of all data. 

Typically, latching devices are used between the cache 
subsystem and the slower main memory. These Write Buffers 
capture the data simultaneous with the cache update, allowing 
the processor to continue to the next cycle without actually 
waiting for the main memory transfer to complete. The 
IDT79R3001 generates parity over the data field on write 
cycles, which can be propagated into both the synchronous 
and asynchronous memory spaces. 

When the processor writes less than a 32-bit quantity (a 
“partial” word), the processor can perform a “read-modify- 
write” of the cache. That is, the processor will read the 32-bit 
word containing the partial address(es) to be updated from the 
cache. If a “hit” occurs, then the new data will be merged with 
the old and the new 32-bit value will be written both to the 
cache and to main memory. If a cache “miss” occurs, then only 
the partial data is written to main memory and the cache is 
unchanged. Partial word capability is selected as a RESET 
option. 
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THE ASYNCHRONOUS MEMORY INTERFACE 

The IDT79R3001 also supports an asynchronous memory 
interface, which supports the use of slower memory devices 
such as slow DRAM or EPROM and also supports the use of 
peripherals and other “non-cacheable” devices. 

In general, if a cache-miss (or parity error, if enabled) 
occurs, the processor will automatically use the asynchro- 
nous memory interface to retrieve the desired data, and will 
update the cache accordingly. 

Additionally, software can force the use of the asynchronous 
memory space through the use of the on-chip MMU. Whenthe 
processor seeks either instructions or data within a certain 
address range (ksegl ), the processor knows that this data is 
uncacheable and will perform an asynchronous memory 
transfer. Additionally, within cacheable memory, TLB entries 
can be used to make certain pages as “uncacheable”. When 
an address of an “uncacheable” page is used, the processor 
will automatically use the asynchronous memory space. 

The asynchronous memory space uses the same data bus 
as the synchronous memory space. This facilitates the 
automatic updating of cache memory when the asynchronous 
memory is accessed due to cache-miss activity or memory 
writes. The asynchronous address bus is composed from the 
synchronous memory AddrLo bus, and the TAG bus. External 
logic devices (such as IDT74FCT374A registers) are used to 
capture AddrLo and T AG values for the asynchronous transfer 


address. Note that systems which exclude individual TAG bits 
from comparison (to reduce cache width) still have all TAGs 
available as outputs. 

The data path between the processor and the asynchro- 
nous memory space is managed according to the needs of the 
application. Write Buffer FIFO devices, such as the 
IDT79R3020, are used to capture address and data during 
store cycles. These devices are used to capture the data in 
one-cycle, and allow the processor to continue to execute 
from the synchronous memory while the slower asynchro- 
nous memory actual retires the write. 

The read path is also constructed according to the needs of 
the system. If block refill is used, then the read path is highly 
dependent onthe designof the main memory system. Pipeline 
devices such as IDT74FCT540A, or simple latches such as 
IDT74FCT374, may be used. 

A simple asynchronous memory interface is shown in 
Figure 7. In this system, main memory is assumed to be fast 
enough to support the block refill requirements of the system, 
thus simplifying the read path. In fact, both the read and write 
data paths are actually managed through a single set of 
IDT29FCT52A bidirectional latching transceivers. 

During write cycles (which are typica lly captured by Write 
Buffers) , the processor asserts MemWr to indicate that a write 
cycle is in progress. The memory system negates WrBusy to 
indicate that the processor is done with the write cycle. 
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Figure 7. IDT79R3001 Asynchronous interface 


5.2 


6 











IDT79R3001 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 



Figure 8. IDT79R3001 Interface to IDT79R3010 Floating Point Co-Processor 


During read cycles, the processor will assert MemRd to 
indicate that a main memory read is in progress. The memory 
system will hold RdBusy active until t he d esired data is 
available. The processor will activate the XEn signal to allow 
data to be passed from the main memory to the processor 
databus. If the cache is to be updated with the new data, then 
the processor will assert the appropriate cache write signal to 
allow the cache RAMs to capture the incoming databus. 

The AccTyp bus is used to indicate the size of the data 
transfer (8, 16, 24, or 32 bits), and for main memory reads, 
whether or not the data is “cacheable”. This simplifies the 
main memory address decoding, since the AccTyp indicates 
whether the main memory needs to perform a burst read of 
multiple words. 

Co-Processor Interface 

The IDT79R3001 implements a co-processor interface, 
which allows the use of the IDT79R3010 high-performance 
RISC Floating Point Accelerator without requiring the use of 
external interface components. 

The co-processor interface has been designed to make 
system co-processors appear to the programmer as if they 
were on-chip extensions of the core execution engine. Thus, 
the IDT79R3010 FPA works as a true co-processor, rather 
than as a peripheral which must be programmed. 

In the IDT79R3001 co-processor model, the CPU is 
responsible for controlling all data cycles. The co-processor 
keeps in synchronization with the CPU (including the pipeline 
stages), and uses a Phase-Locked Loop to keep synchronized 



withtheprocessorbustraffic. The co-processorthen “snoops” 
the data bus, watching for co-processor instructions. It also 
knows when data cycles on the bus are intended for it (either 
as a target in co-processor load operations, or as a source for 
co-processor restore operations), and performs the data 
portionoftheoperationwhen appropriate. Thus, co-processors 
effectively load and store directly with memory, without requiring 
operands to go through the CPU first. This achieves the 
highest levels of performance (note that the co-processor 
interface also supports move, whereby data can be moved 
directly between the CPU and any co-processor). 

Figure 8 illustrates the use of the IDT79R3010 in a 
IDT79R3001 system. The co-processor interface manages 
synchronization between the parts, and is used to communi- 
cate status from the co-processorto the CPU. CpBusy, or co- 
processor busy, stalls the CPU until the busy co-processor 
resource (requested by a co-processor instruction) is free, 
and CpCond, or co-processor condition, is used to report 
status on co-processor test instructions. CpSync, is used to 
help the co-processor stay “locked” to the CPU , so that the co- 
processor knows when data is on the bus to be sampled on 
load operations or when to place data on the bus for store 
operations. 

Note that the co-processor sits on the same data bus as the 
CPU, but has no connection to the address bus. The CPU is 
responsible for performing all memory addressing, including 
the determination of “cache hit”, write-buffer full cycles, and 
any processing that might be required for cache misses. 
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INTERRUPTS 

The IDT79R3001 features 6 separate interrupt input pins. 
Interrupts are not vectored, but rather cause the general 
exception vector address to be the next execution address. 

These pins are not encoded internally; external logic can 
choose to implement these interrupt lines as either 6 or 64 
interrupt sources; software would then perform the appropri- 
ate decoding to get to the specific interrupt handler. 

Interrupts are recognized in the ALU stage of the on-chip 
pipeline. Instructions less advanced in the pipeline are 
‘Hushed” and will be restarted when the return from exception 
occurs (an on-chip register contains the address of the 
instructionwhich was excepted). Instructions further advanced 
in the pipeline are allowed to continue. Unlike other RISC 
processors, the IDT79R3001 does not require the program- 
mer to save and restore pipeline status to allow normal 
execution to be resumed. Depending on the application and 
exception, at most software would need to save/restore the 
on-chip data registers, status register, Exception PC and 
exception “cause” register. 

Note that the co-processor model includes “precise ex- 
ceptions.” That is, an exception is signaled to the exact 
instruction which generated the exceptional condition. No 
further state commitments are made by the IDT79R3001 and, 
thus, the exact context at the time of the exception is known 
to the programmer. This is true even for multi-cycle opera- 
tions, such as those of the FPA. 
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DMA INTERFACE 

The IDT79R3001 features a simple DMA interface which 
allows an external master to gain control of the synchronous 
memory space. Note that it is not necessary to include logic 
on the CPU to arbitrate for the asynchronous memory space; 
the read/write buffer interface is where such arbitration logic 
belongs and it is left to the system designer to implement the 
type of asynchronous memory structure that best fits the 
application. 

When an external master “owns" the synchronous bus, the 
CPU will tri-state the following pins and buses: 

AddrLo: The Synchronous memory direct address bus. 

Data & Tag: The synchronous memory RAM data lines. 

Cache Control: iRd, TWr, ICIk, DRd, DWr and DCIk. This 
allows the external masterto use the existing control 
lines to control the synchronous memory. 

XEn: The read buffer transceiver enable, which will 
allow the external master to use the read/write 
buffer path for DMA. 

Valid: This enables the DMA interface to be used for 
multi-processing applications. 
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Figure 9. IDT79R3001 DMA Interface 
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Input 

W Cycle 

X Cycle 

Y Cycle 

Z Cycle 

Into 

Reserved 

Reserved 

Reserved 

Reserved 

TntT 

Reserved 

Reserved 

Reserved 

Reserved 

Int2 

DBIkSizeO 

DBIkSizel 

Parity On 

Valid Output 

Int3 

IBIkSizeO 

IBIkSizel 

StorePartial 

ControlLow 

Int4 

PliOn 

PHOn 

PliOn 

PliOn 

Irrt5 

Reserved 

BigEndian 

TriState 

Reserved 


NOTE: 2873 tbl 01 

1. Reserved signals must be "high” during these cycles. 


Table 1. IDT79R3001 Mode Selectable Features 


The DMA interface consists of a single input signal, 
DMAStall, which causes the processor to stall and to tri-state 
the above named lines. The external master is guaranteed 
mastership of the bus within a very short number of cycles, 
depending on the exact external bus activity of the CPU when 
the DMA was requested. The DMA master negates the 
DMAStall signal when the DMA operation is completed to 
allowtheCPUto resume processing. Consultthe“IDT79R3001 
Hardware User’s Guide” for more details. 

Figure 9 illustrates the system connection of an external 
DMA master to a IDT79R3001 system. 

ADVANCED FEATURES 

The IDT79R3001 contains special features which provide 
added flexibility across a number of applications, as well as 
allow for system diagnostic support. 

In support of diagnostics, the I DT79R3001 allows for cache 
"swapping" (interchange of which memory bank is for instruc- 
tion and which isfordata), which is useful in system initialization, 
cache flushing, and diagnostics. Additionally, the caches can 
be “isolated" from main memory, which forces cache “hits” to 
occur regardless of the tag comparison, and which is useful in 
determining that the synchronous memory space RAMs are 
functional. 

An additional feature is the ability to enable parity checking 
over the data field of each synchronous memory. If parity is 


General Purpose Registers Multiply/Divide Registers 



Figure 10. IDT79R3001 Registers 


enabled, the processor will check the parity when a synchro- 
nous access occurs; if a pari ty error is de tected, it is signaled 
to the external world on the Parity Error signal and a cache- 
miss cycle is processed, the Parity Error signal will remain low 
until the parity error flag in the CPO status register is cleared 
by software. 

A number of other system selectable features are selected 
at reset time. The input reset “vectors” are sampled on the 
interrupt input lines during the last four cycles of the reset 
period. The input vectors are listed in Table 1. These 
selections include the ability to select the block refill sizes for 
each of the instruction and data memories, whether Big 
Endian or Little Endian order is to be used, whetherto use data 
parity, and whether or not to accommodate a Phase-Locked 
Loop for a co-processor. The initialization of the CPU and 
meaning of each input vector is more fully explained in the 
“IDT79R3001 Hardware User’s Guide”. 

PROCESSOR ARCHITECTURE 

The IDT79R3001 is a full implementation of the 
IDT79R2000A/IDT79R3000 Instruction Set Architecture (the 
MIPS-I ISA). This architecture is discussed in great detail in 
“mips RISC Architecture,” available from IDT. 

IDT79R3001 CPU Registers 

The IDT79R3001 CPU provides 32 general purpose 
(orthogonal) 32-bit registers, a 32-bit Program Counter and 
two 32-bit registers used to hold the results of the CPU integer 
multiply and divide operations. 

Two of the 32 general registers have special purposes 
designed to increase processor performance: register rO is 
hardwired to the value “0”, a useful constant; and register r31 
is used as the link register in jump-and-link instructions (the 
return address for subroutine calls). Otherwise, there is no 
requirement that a particular register be used as a stack or 
frame pointer, etc., although there is a register convention as 
part of the “mips ABI” (Applications Binary Interface standard) 
which the compiler suite uses. 

The CPU registers are illustrated in Figure 10. Note that 
there is no Program Status Word register shown in this figure. 
The functions traditionally provided by a PSW register are 
instead provided in the Status and Cause Registers incorpo- 
rated within the on-chip System Control Co-Processor (CPO). 
The instruction set does not use condition codes. 
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Instruction Set Overview 

All IDT79R3001 instructions are 32 bits long and there are 
only three instruction formats (see Figure 11). This approach 
simplifies decoding, thus minimizing instruction execution 
time. The IDT79R3001 processor initiates a new instruction 
on every RUN cycle, and is able to complete an instruction on 
almost every clock cycle. The only exceptions are the LOAD 
instructions and BRANCH instructions, which each have a 
single cycle of latency associated with their execution (that is, 
the instruction immediately afterthe branch is always executed 
regardless of the branch condition; similarly, the data loaded 
by a LOAD instruction is not available to the subsequent 
instruction). However, in the majority of cases the compilers 
(and even the MIPS assembler) are able to reorder instruc- 
tions to fill these latency cycles with useful instructions which 
do not require the results of the previous instruction (in the 
worst case, a NOP instruction is inserted). This effectively 
eliminates these latency effects and does not require the 
applications programmerto be aware of the pipeline structure. 

The actual instruction set of the CPU was determined after 
extensive simulations to determine which instructions should 
be implemented in hardware and which operations are best 
synthesized in software from other basic operations. This 
methodology has resulted in the highest performance 
processor available. 

The IDT79R3001 instruction set can be divided into the 
following groups: 

• Load/Store Instructions move data between memory 
and the general registers. These are all “1-Type” instruc- 
tions. The only addressing mode supported is base regis- 
ter plus signed, immediate 16-bit offset. This effectively 
allows three addressing modes: register plus offset, reg- 
ister (using zero offset), and immediate (using rO,the zero 
register). 

The Load instruction has a single cycle of latency, as 
described above. That is, the instruction immediately after 
the load instruction cannot rely on the new data; however, 
the assembler and compilers automatically handle this, 
reordering code to insure that no conflicts occur. Note that 
the store operation has no latency in its effect. 

Loads and stores can be performed on byte, half-word, 
word, or unaligned word data (32-bit data not aligned on a 
modulo-4 address). 

• Computational instructions perform arithmetic, logical, and 
shift operations on values in registers. They occur in both 
“R-Type” (both operands and the result are general regis- 
ters), and “1-Type” (one operand is a 16-bit immediate 
value) formats. 

Note that computational instructions are three operand 
instructions: that is, the result register can be different from 
both source registers. This means that operands need not 
be overwritten by arithmetic operations. This results in a 
more efficient use of the register set, and further increases 
performance. 


I-Type (Immediate) 



Figure 11. IDT79R3001 Instruction Formats 


• Jump and Branch instructions change the flow of control of 
a program. Jumps are always to a paged absolute address 
formed by combining a 26-bit target with four bits of the 
Program Counter (“J-Type” format for subroutine calls), or 
32-bit register byte addresses ("R-Type,” for Returns and 
dispatches). Branches have 16-bit offsets relative to the 
program counter (“1-Type”). 

Jump and Link instructions save a return address in 
Register 31. The IDT79R3001 instruction set features 
numerous branch conditions. Included is the ability to 
branch based on a comparison of two registers, or on the 
comparison of a registerto zero. Thus, net performance is 
increased since the processor does not have to precede 
the branch instruction with arithmetic operations. 

• Co-processor instructions perform operations in the co- 
processors (such as the IDT79R3010FPA). Co-processor 
Loads and Stores are “1-Type;” computational instructions 
have co-processor dependent formats. 

• Co-processor 0 instructions perform operations on the 
System Control Co-processor (CPO) registers to manipu- 
late the memory management and exception handling 
facilities of the on-chip co-processor. 

• Special instructions perform a variety of tasks, including 
movement of data between general and special registers, 
system calls, and breakpoint operations. These are always 
“R-Type.” 

IDT79R3001 System Control Co-processor (CPO) 

The IDT79R3001 canoperatewithuptofourtightly coupled 
co-processors, designated CP0-CP3. CPO is included on- 
chip as co-processor 0, the System Control co-processor. 
CPO is responsible for supporting both the virtual memory 
system and the exception handling functions of the 
IDT79R3001 . 
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OP 

Description 

OP 

Description 


Load/Store Instructions 


Multiply/Divide Instructions 

LB 

Load Byte 

MULT 

Multiply 

LBU 

Load Byte Unsigned 

MULTU 

Multiply Unsigned 

LH 

Load Halfword 

DIV 

Divide 

LHU 

Load Halfword Unsigned 

DIVU 

Divide Unsigned 

LW 

Load Word 

MFHI 

Move From HI 

LWL 

Load Word Left 

MTHI 

Move To Hi 

LWR 

Load Word Right 

MFLO 

Move From LO 

SB 

Store Byte 

MTLO 

Move To LO 

SH 

Store Halfword 



SW 

Store Word 


Jump and Branch Instructions 

SWL 

Store Word Left 

J 

Jump 

SWR 

Store Word Right 

JAL 

Jump and Link 



JR 

Jump to Register 


Arithmetic Instructions 

JALR 

Jump and Link Register 


(ALU Immediate) 

BEQ 

Branch on Equal 

ADDI 

Add Immediate 

BNE 

Branch on Not Equal 

ADDIU 

Add Immediate Unsigned 

BLEZ 

Branch on Less than or Equal to Zero 

SLTI 

Set on Less Than Immediate 

BGTZ 

Branch on Greater Than Zero 

SLTIU 

Set on Less Than Immediate 

BLTZ 

Branch on Less Than Zero 


Unsigned 

BGEZ 

Branch on Greater than or 

ANDI 

AND Immediate 


Equal to Zero 

ORI 

OR Immediate 

BLTZAL 

Branch on Less Than Zero and Link 

XORI 

Exclusive OR Immediate 

BGEZAL 

Branch on Greater than or Equal to 

LUI 

Load Upper Immediate 


Zero and Link 




Special Instructions 


Arithmetic Instructions 

SYSCALL 

System Call 


(3-operand, register-type) 

BREAK 

Break 

ADD 

Add 



ADDU 

Add Unsigned 


Coprocessor Instructions 

SUB 

Subtract 

LWCz 

Load Word from Coprocessor 

SUBU 

Subtract Unsigned 

SWCz 

Store Word to Coprocessor 

SLT 

Set on Less Than 

MTCz 

Move To Coprocessor 



MFCz 

Move From Coprocessor 

SLTU 

Set on Less Than Unsigned 

CTCz 

Move Control to Coprocessor 

AND 

AND 

CFCz 

Move Control From Coprocessor 

OR 

OR 

COPz 

Coprocessor Operation 

XOR 

Exclusive OR 

BCzT 

Branch on Coprocessor z True 

NOR 

NOR 

BCzF 

Branch on Coprocessor z False 


Shift Instructions 


System Control Coprocessor 

SLL 

Shift Left Logical 


(CPO) Instructions 

SRL 

Shift Right Logical 

MTCO 

Move To CPO 

SRA 

Shift Right Arithmetic 

MFCO 

Move From CPO 

SLLV 

Shift Left Logical Variable 

TLBR 

Read indexed TLB entry 

SRLV 

Shift Right Logical Variable 

TLBWI 

Write Indexed TLB entry 

SRAV 

Shift Right Arithmetic Variable 

TLBWR 

Write Random TLB entry 



TLBP 

Probe TLB for matching entry 



RFE 

Restore From Exception 
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Table 2. IDT79R3001 Instruction Summary 
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CPO Registers 

As a co-processor, CPO has a number of registers which it 
uses to perform its control functions. These include 64 fully 
associative Translation Lookaside Buffers (TLBs), used to 
manage the virtual memory space; registers to manage the 
TLB set; and the exception handling registers. Figure 12 
illustratesthe register set of the System Control Co-processor. 
Table 3 provides a brief explanation of the function of each of 
these registers. A more detailed explanation of the use of 
each of these registers is included in the “mips RISC 
Architecture” manual. 

Memory Management System 

The IDT79R3001 supports a virtual memory system, so 
that each task in a given application can be unaware of the 
addressing needs of other tasks. This is also useful in 
systems with limited physical memory; the IDT79R3001 pro- 
vides for the logical expansion of memory by translating 
addresses composed in a large virtual space into available 
physical memory addresses. 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 



| | Used with Virtual Memory System 

| | Used with Exception Processing 
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Figure 12. The System Control Co-processor (CPO) Registers 


Register 

Description 

EntryHi 

High half of a TLB entry 

EntryLo 

Low half of a TLB entry 

Index 

Programmable pointer into TLB array 

Random 

Pseudo-random pointer into TLB array 

Status 

Mode, interrupt enables and diagnostic status 
information 

Cause 

Indicates nature of last exception 

EPC 

Exception Program Counter — contains address of 
instruction which detected the exception 

Context 

Pointer into kernel’s virtual Page Table Entry array 

BadVA 

Most recent bad virtual address 

PrID 

Processor revision identification (Read only) 


2873 tbl 03 


Table 3. CPO Registers 
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MMU ADDRESS TRANSLATION 

VIRTUAL PHYSICAL 


Oxffffffff 


OxcOOOOOOO 


OxaOOOOOOO 


0x80000000 


0x00000000 


Kernel Mapped 
Cacheable 
(kseg2) 

ANY^> 


Kernel Uncached 



(ksegl) 



Physical 

Memory 

Kernel Cached 



(ksegO) 




User Mapped 
Cacheable 

ANY^> 



(kuseg) 

—+> 

Memory 


y 3584 MB 




f 


512 MB 
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IDT79R3001 Operating Modes 

The IDT79R3001 has two operating modes: User Mode 
and Kernel Mode. The IDT79R3001 normally operates in the 
User Mode until an exception is detected, forcing it into the 
Kernel Mode. The processor remains in Kernel Mode until the 
exceptions are handled and the processor executes an RFE 
(Return from Exception) instruction, which will restore it to 
User Mode. Kernel Mode allows software to alter machine 
state information such as that contained in the CPO registers; 
that is, if in User Mode an access is attempted to Co-processor 
0 and the Kernel has not enabled the User to access the co- 
processor, an exception will occur. Similarly, if a User task 
attempts to use a Kernel virtual address, an exception will 
occur. Thus, system resources are protected from Usertasks. 

The manner in which memory addresses are translated 
(mapped) depends on the operating mode of the IDT79 R3001 
and on the virtual address desired. Figure 13 illustrates the 
virtual address mapping performed by the IDT79R3001 : 

User Mode — inthis mode, a single, uniform virtual address 
space (kuseg) of 2 Gbyte is available to each user task (tasks 
are further identified by a 6-bit process identifier field in order 
to form unique virtual addresses). All references to this 


segment are mapped using the TLB, which utilizes both the 
virtual address and the Process I D field to perform the virtual- 
to-physical mapping (note that this allows the cache to be 
shared by up to 64 User processes at a time without requiring 
time consuming Cache or TLB flushing). 

Kernel Mode — Four separate segments are accessible 
through this mode: 

• kuseg — When in the Kernel Mode, references to this 
segment are treated just like User Mode references, thus 
streamlining Kernel accesses to User memory. 

• ksegO — References to this 51 2 Mbyte segment may use 
the cache memory, but are not translated by the TLB. 
Instead, these addresses map directly to the first 512 
Mbytes of the physical address space. Note that many 
dedicated embedded applications will utilize this address 
space and kseglonly, rather than any of the TLB mapped 
segments. 

• ksegl— References to this 512Mbyte segment are not 
mapped through the TLB. Additionally, this memory is 
viewed as uncacheable, which means that references 
through this segment will always use the asynchronous 
memory interface. As with ksegO, references through this 



63 

44 
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31 
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11 

10 

9 
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0 

VPN 

TLBPID 

O 

PFN 

N 

D 

V 

G 

0 

v 


Y 

A 




^ 







J 


ENTRYHI ENTRYLO 


VPN - Virtual Page Number 
TLBPID - Process ID 
PFN - Physical Frame Number 
N - Non-cacheable Physical Page 


D - Dirty PageA/Vrite Protect 
V - Valid entry flag 
G - Global flag (ignore PID) 

0 - Reserved 

2873 drw 14 


Figure 14. TLB Entry Format 
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segment are hard-mapped to the first 512 Mbytes of 
physical memory. When the processor boots, the reset 
vector is contained in this segment, so that the processor 
does not require either the cache or the TLB to be valid at 
RESET time. 

• kseg2— References to this 1 Gbyte segment are always 
mapped through the TLB. As with kuseg, the ability of 
memory pages to be cached is determined by a bit setting 
in the TLB entry for that page. 

The Translation Lookaside Buffer (TLB) 

The translationof virtual addresses in eitherkusegorkseg2 
(mapped segments) is performed by the on-chip Translation 
Lookaside Buffer array. This array consists of 64 fully- 
associative (content addressable) memory elements. Each 
entry maps a 4Kbyte virtual page to a 4Kbyte physical page. 
Each TLB entry contains other information about the virtual 
address it maps (such as which User process it maps) and 
also aboutthe physical address (such aswhetherit is cacheable 
orwriteable). 

Figure 14 illustrates the format of each TLB entry. The 
translation operation is illustrated in Figure 15. The upper 
portion of the desired virtual address is compared against the 
VPN field of each TLB entry. Additionally, the current process 
ID (contained in the TLBHI register) is matched against the 
PID fieldof the TLB entry (if the TLB entry is marked as Global, 
the PID comparison is ignored). If a match occurs, and the 
TLB entry is marked as Valid, then the translation is completed 
by replacing the VPN of the virtual address with the corre- 
sponding PFN (Physical Frame Number). 


Note that the use of the TLB does not incur an execution 
penalty, since the execution engine pipeline includes stages 
to cover for the time required to make the TLB search and 
translation. 

TLB misses occur when no successful match occurs. 
These events are handled in software. The CPO registers give 
the software enough information to obtain the appropriate TLB 
entry at speeds which exceed those achieved by many CPUs 
which use hardware TLB replacement (10-12 cycles under 
UNIX). 

When a TLB miss occurs, the address of the instruction 
which was executing is stored in the EPC register, and the 
BadVA register contains the address which was being trans- 
lated. The Context register uses the BadVA value to generate 
a direct pointer to the kernel Page T able Entry for the desired 
virtual address. The Random register suggests the TLB entry 
to be replaced by the new entry. Note that the lower eight TLB 
entries are not pointed to by Random; the kernel software can 
thus insure that it is constantly mapped, and deterministic 
response is guaranteed. 

BACKWARD COMPATIBILITY WITH 
IDT79R2000A AND 79R3000 PROCESSORS 

The IDT79R3001 can execute the same binary software 
(either kernel or user) that is executed by either the 
IDT79R2000A or IDT79R3000. At the system level, some 
hardware re-design is necessary to achieve the cost savings 
inherent in the IDT79R3001 hardware interface. 
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Figure 15. Virtual to Physical TLB Translation 
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PIN DESCRIPTIONS 


Pin Name 

I/O 

Description 

| Memory Interface 

Data (0:31) 

I/O 

A 32-bit bus used for all instruction and data transmission among the processor, synchronous memory space, 
asynchronous memory space and co-processors. 

DataP (0:3) 

I/O 

A 4-bit bus containing even parity over the data bus. If parity checking is enabled, a parity error will cause the PErr 
signal to be asserted and a cache-miss to occur. Regardless of whether parity checking is enabled, the processor 
will always generate parity on writes. 

Tag (13:31) 

I/O 

A 19-bit bus used for transferring cache tags and high-order address bits between the processor, caches and 
asynchronous memory spaces. 

AddrLo (0:23) 

o 

A 24-bit bus containing low-order byte addresses for both the synchronous (cache) and asynchronous memory 
spaces. 

| Synchronous Memory Control 

IRd 

o 

The output enable for the instruction cache. The polarity of this signal is selectable. 

IWr 

o 

The write enable for the instruction cache. The polarity of this signal is selectable. 

ICIk 

o 

The instruction cache address latch clock. The clock runs continuously. 

DRd 

0 

The output enable for the data cache. The polarity of this signal is selectable. 

DWr 

o 

The write enable for the data cache. The polarity of this signal is selectable. 

DCIk 

o 

The data cache address latch clock. The clock runs continuously. 

Valid 

I/O 

A high on this signal indicates that the Tags just read from the cache are valid. When a cache update occurs, the 
processor will generate the appropriate Valid bit. 

PErr 

0 

If parity checking is enabled, this signal is an active low output of the internal CPO parity error status bit. it is driven 
low when a parity error is detected and remains low until software clears the parity error flag in the status register. This 
pin is physically the same pin as AccTyp2. Its function is selected during device reset. 


Asynchronous Memory Interface 


XEn 


AccTyp (0:2) 


MemWr 


MemRd 


BusError 


Run 



The transceiver enable for the read buffer. 


A 3-bit bus used to indicate the size of data being transferred on the asynchronous memory bus, whether omot a data 
transfer is occurring and the purpose of the transfer. If parity checking is enabled, AccTyp2 becomes the PErr signal. 


Signals the occurrence of an asynchronous memory write cycle. 


Signals the occurrence of an asynchronous memory read cycle. 


Signals the occurrence of a bus error during an asynchronous memory transfer cycle. 


Indicates whether the processor is in a RUN or STALL state. 


Indicates the instruction about to commit processor state should be aborted and other exception related information. 


A cbck derived from the internal processor clock used to generate the system clock. 


The asynchronous memory read stall termination signal. In most system designs, RdBusy is normally asserted and 
is deasserted only to indicate the successful completion of the memory read. RdBusy is sampled by the processor 
only during memory read stalls. 


The asynchronous memory write stall initiation/termination signal. WrBusy is only sampled during write operation. 


Co-Processor Interface 

CpSync 

O 

A clock which is identical to SysOut and used by co-processors for timing synchronization with the CPU. 

CPBusy 

1 

The co-processor busy stall initiation/termination signal. 

CpCond (0:3) 

■ 

A4-bit bus used to transferconditional branch status from the co-processors to the CPU. CpCond(O) is used to control 
whether or not a cache burst refill occurs; the other signals are used as input port pins for co-processor branch 
instructions. 

Processor Control Signals 



Clk2xSys 


Clk2xSmp/Rd 


Clk2xPhi 


Reset 


DMA Stall. Signals to the processor that it should stall accesses to the synchronous memories and tri-state the 
synchronous memory interface. 


A 6-bit bus used to signal maskable interrupts to the CPU. A reset time, mode values are sampled from this bus to 
initialize the processor. During normal operation, these signals are not latched by the processor and must remain 
asserted until the processor acknowledges the interrupt (through software) to the interrupt source. 


The master double frequency input clock, used to generate SysOut. 


A double frequency clock input used to determine the sample point for data coming into the CPU and co-processors 
and used to determine the enable time of the synchronous memory RAMs. 


A double frequency clock input used to determine the position of the two internal phases. 


Initialization input used to force execution starting from the reset memory ad dress. R eset should be asserted 
asynchronously but must be negated synchronously with the leading edge of SysOut. 
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IDT79R3001 

RISControlIer FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


ABSOLUTE MAXIMUM RATINGS 0, 3) 


Symbol 

Rating 

Commercial 

Military 

Unit! 

Vterm 

Terminal Voltage 
with Respect 
toGND 

-0.5 to +7.0 

-0.5 to +7.0 


Ta.Tc 

Operating 

Temperature 

0 to +70C> 
(Ambient) 

0 to +90< 5 > 
(Case) 

-55 to +125 
(Case) 

1 

Tbias 

Case Temperature 
Under Bias 

-55 to+125«> 
0 to +90< 5 > 

—65 to +135 

H 

Tstg 

Storage 

Temperature 

-55 to +125 

-65 to +155 

°C 

IlN 

Input Voltage 

—0.5 to +7.0 

-0.5 to +7.0 

V 


NOTE: 2873 tb) 05 


1. Stresses greater than those listed under ABSOLUTE MAXIMUM RATINGS 
may cause permanent damage to the device. This is a stress rating only 
and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended 
periods may affect reliability. 

2. Vin minimum = -3.0V for pulse width less than 15ns. 

Vin should not exceed Vcc +0.5 Volts. 

3. Not more than one output should be shorted at a time. Duration of the short 
should not exceed 30 seconds. 

4. 16-33 MHz only. 

5. 37-40 MHz only. 


AC TEST CONDITIONS 


RECOMMENDED OPERATING 
TEMPERATURE AND SUPPLY VOLTAGE 


Grade 

Temperature 

GND 

Vcc 

Military 

16-33 MHZ 

-55°C to +125°C 
(Case) 

0 V 

5.0 ±10% 

Commercial 
16-33 MHz 

0°C to +70°C 
(Ambient) 

ov 

5.0 ±5% 

Commercial 
37-40 MHz 

0°C to +90°C 
(Case) 

ov 

5.0 ±5% 


2873 tbl 07 


OUTPUT LOADING FOR AC TESTING 



To Device 
Under Test 


2873 drw 16 


Signal 

Cl 

IRd, IWr, DRd, DWr 

50pf 

All Others 

25pf 


Symbol 

Parameter 

Min. 

Max. 

Unit 

VlH 

Input HIGH Voltage 

3.0 

— 

V 

VlL 

Input LOW Voltage 

— 

0.4 

V 

VlHS 

Input HIGH Voltage 

3.5 

— 

V 

VlLS 

Input LOW Voltage 

_ 

0.4 

V 


2873 tbl 06 
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IDT79R3001 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS 


COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V ±5%) 



Parameter 

Test Conditions 

16.67MHz 

20.0MHz 

25.0MHz 

33.33MHz 

Unit 

Min. 


Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

VOH 

Output HIGH Voltage 

Vcc = Min., Ioh = -4mA 

3.5 

— 

3.5 

— 

3.5 

— 

3.5 

— 

a 

VOL 

Output LOW Voltage 

Vcc = Min., Iol = 4mA 

— 

ED 

— 

0.4 

— 

0.4 

— 

0.4 

D 

VOHT 

Output HIGH Voltage* 4 ’ 7 ' 

Vcc = Min., Ioh = -8mA 

m 

— 

EM 

— 

EM 

— 

EM 

— 

a 

Vohc 

Output HIGH Voltage* 8 ) 

Vcc = Min., Ioh = -4mA 

4.0 

— 

4.0 

— 

4.0 

— 

4.0 

— 

a 

Volt 

Output LOW Voltage* 4 ' 7 ) 

Vcc = Min., Iol = 8mA 

— 

0.8 

— 

0.8 

— 

0.8 

— 

0.8 

D 

Vih 

Input HIGH Voltage* 5 ) 


2.0 

— 

2.0 

— 

2.0 

— 

2.0 

— 


Vil 

Input LOW Voltage 


— 

0.8 

— 

0.8 

— 

0.8 

— 

0.8 

a 

Vihs 

Input HIGH Voltage* 2 ' 5 ) 


3.0 

— 

3.0 

— 

3.0 

— 

3.0 

— 

a 

Vils 

Input LOW Voltage* 1,2 ) 


— 

0.4 

— 

0.4 

— 

0.4 

— 

0.4 

D 

Ireset 

Input HIGH Current* 6 ) 


10 

100 

10 

100 

10 

100 

10 

100 

3 

ClN 

Input Capacitance* 7 ) 


— 

10 

— 

10 

— 

10 

— 

10 

O 

Cout 

Output Capacitance* 7 ) 


— 

10 

— 

10 

— 

10 

— 

10 

o 

Icc 

Operating Current 

Vcc = Max. 

— 

575 

— 

650 

— 

750 

— 

800 


IlH 

Input HIGH Leakage* 3 ) 

Vih = Vcc 

— 

100 

— 

100 

— 

100 

— 

100 

EQ| 

III 

Input LOW Leakage* 3 ' 

Vil = GND 

-100 

— 

-100 

— 

-100 

— 

-100 

— 

^3 

loz 

Output Tri-state Leakage 

Voh = 2.4V, VOL = 0.5V 

-100 

100 

-100 

100 

-100 

100 

-100 

100 

m 


NOTES: 2873 tw oa 

1. Vil Min. = -3.0V for pulse width less than 15ns. Vil should not fall below -0.5 Vo lts for larger periods. 

2. Vihs and Vils apply to Clk2xSys, Clk2xSmp/Rd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. 

4. VoHTand Volt apply to the bidirectional data and tag buses only. Note that Vih and Vil also apply to these signals. Voht and Volt are supplies as additional 
information to help the system designer understand the relationship between current drive and output voltage on these pins.. 

5. Vih should not be held above Vcc + 0.5 volts. 

6. The IDT79R3001 contains an internal pull-up/current source on the TAG pins to facilitate initialization. This current source is disconnected when Reset 
is inactive. 

7. Guaranteed by design . 

8. Vohc applies to RUN and Exception. 
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IDT79R3001 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS 

COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V ±5%) 


Symbol 

Parameter 

Test Conditions 

37.0MHz 

40.0MHz 

Unit 

Min. 

Max. 

Min. 

Max. 

VOH 

Output HIGH Voltage 

Vcc = Min., Ioh = -4mA 

3.5 

— 

3.5 

TT 

a 

VOL 

Output LOW Voltage 

Vcc = Min., Iol = 4mA 

— 

0.4 

— 

•Co;*,. 

D 

Vohc 

Output HIGH Voltage* 7 ) 

Vcc = Min., Ioh = -4mA 

4.0 

— 

4-0 .| 



Voht 

Output HIGH Voltage* 4 - 6 ) 

Vcc = Min., Ioh = -8mA 

2.4 

— 

2.4 4::,, 

— 

m 

Volt 

Output LOW Voltage* 4 - 6 * 

Vcc = Min., Iol = 8mA 

— 

0.8 


0.8 

D 

Vih 

Input HIGH Voltage* 5 ) 


2.0 

— 

\ 2.0 • 

— 

D 

Vil 

Input LOW Voltage* 1 ) 


— 

0.8 4 | 

■■■ ^ 

0.8 

a 

Vihs 

Input HIGH Voltage* 2 ' 5 ) 


3.0 

— 

' 3.0 

— 

a 

Vils 

Input LOW Voltage* 1 ' 2 ) 


— 

-0.4 

— 

-0.4 

D 

Ireset 

Input HIGH Current* 6 ) 


10 

. 4yq%' :r 

10 

100 


Cin 

Input Capacitance* 6 ) 


— 4 

. : ,;to 

— 

10 

m 

Cout 

Output Capacitance* 6 ) 


— 1% <; . 

W io 

— 

10 

o 

Icc 

Operating Current 

Vcc = 5V, Ta = 70°C 

-4 ., 

825 

— 

850 


IlH 

Input HIGH Leakage* 3 ) 

Vih = VCC 


100 

— 

100 

in 

IlL 

Input LOW Leakage* 3 ) 

Vil = GND 

-100 

— 

-100 

— 

m 

102 

Output Tri-state Leakage 

Voh = VCC, Vol = GND 

-100 

100 

-100 

100 

Bi 


NOTES: 2B73 tbi 09 

1. Vil Min. = -3.0V for pulse width less than 15ns. Vil should not fall below -0.5 V olts for larger periods. 

2. Vihs and Vils apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. 

4. Voht and Volt apply to the bidirectional data and tag buses only. Note that Vih and Vil also apply to these signals. Voht and Volt are provided 
to give the designer further information about these specific signals. 

5. Vih should not be held above Vcc + 0.5 volts. 

6. Guaranteed by design . 

7. Vohc applies to RUN and Exception. 
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IDT79R3001 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS 


MILITARY TEMPERATURE RANGE (Tc = -55°Cto +125°C, Vcc = +5.0V±10%) 


mm 



16.67MHz 

20.0MHz 

25.0MHz 

33.33MHz 

mm 


Parameter 

Test Conditions 

Min. 


Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Unit 

VOH 

Output HIGH Voltage 

Vcc= Min., Ioh = -4mA 

3.5 

— 

3.5 

— 

3.5 

— 

3.5 

, — 

a 

VOL 

Output LOW Voltage 

Vcc = Min., Iol = 4mA 

— 

0.4 

— 

0.4 

— 

0.4 

— y;*: 

||0. 4 

D 

Voht 

Output HIGH Voltage< 4 ’ 7 ) 

Vcc = Min., Ioh = -8mA 

m 

— 

ESI 

— 

ESI 

— 

2.4 


a 

Vohc 

Output HIGH Voltage! 8 ) 

Vcc = Min., Ioh = -4mA 

4.0 

— 

4.0 

— 

4.0 


US 

_ 

D 

Volt 

Output LOW Voltage! 4 ’ 7 ) 

Vcc = Min., Iol = 8mA 

— 

0.8 

— 

0.8 

— 

0.8 

!— 

0.8 

D 

Vih 

Input HIGH Voltage! 5 ) 


2.0 

— 


— 

2-Ql! 

— y 

2.0 

— 

a 

Vil 

Input LOW Voltage 


— 

0.8 

— 

0.8 


f 0.8 

— 

0.8 

a 

Vihs 

Input HIGH Voltage! 2 ' 5 ) 


3.0 

— 

3.0 


3.0 

— 

3.0 

— 

m 

Vils 

Input LOW Voltage! 1 ’ 2 ) 


— 

0.4 

— 

EB 

if _ 

0.4 

— 

0.4 

a 

Ireset 

Input HIGH Current* 6 ) 


10 

100 

10, 

100 

10 

100 

10 

100 

E3 

ClN 

Input Capacitance! 7 ) 


— 

10 

#— X:; ! 

Pi o 

— 

10 

— 

10 

O 

COUT 

Output Capacitance* 7 ) 


— 

10 

fir' 

10 

— 

10 

— 

10 

EB 

Icc 

Operating Current 

Vcc = Max. 

— . 


P : - — 

600 

— 

650 

— 

750 


IlH 

Input HIGH Leakage* 3 ) 

Vih = Vcc 


100 

— 

100 

— 

100 

— 

100 

13 

IlL 

Input LOW Leakage* 3 ) 

Vil = GND 

-100 

* — 

-100 

— 

-100 

— 

-100 

— 

13 

loz 

Output Tri-state Leakage 

VOH = 2.4V, VOL = 0.5V 

-100 

100 

-100 

100 

-100 

100 

-100 

100 

13 


NOTES: 2873 tbl 08 

1 . Vil Min. = -3.0V for pulse width less than 1 5ns. Vil should not fall below -0.5 Volts for larger periods. 

2. Vihs and Vils apply to C!k2xSys, Clk2xSmp/Rd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. 

4. Voht and Volt apply to the bidirectional data and tag buses only. Note that Vih and ViLalso apply to these signals. Voht and Volt are supplies as additional 
information to help the system designer understand the relationship between current drive and output voltage on these pins.. 

5. Vih should not be held above Vcc + 0.5 volts. 

6. The IDT79R3001 contains an internal pull-up/current source on the TAG pins to facilitate initialization. This current source is disconnected when Reset 
is inactive. 

7. Guaranteed by design . 

8. Vohc applies to RUN and Exception. 
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IDT79R3001 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS * 1 < 4 > 

COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V ±5%) 


Symbol 

Parameter 

Clock 

TCkHigh 

Input Clock High* 2 ) 

TCkLow 

Input Clock Low® 

TCkP 

Input Clock Period® 


Clk2xSys to Clk2xSmp/Rd® 

Clk2xSmp/Rd to Clk2xPhi® 


Test Conditions 


16.67MHz 


Min. Max, 


20.0MHz 




25.0MHz 


Max. 


Run Operation 


TDEn 

Data Enable® 

TDDIs 

Data Disable* 3 ) 

TDVal 

Data Valid 

TWrDly 

Write Delay 

Tds 

Data Set-up 

Tdh 

Data Hold 

Tcbs 

CpBusy Set-up 

Tcbh 

CpBusy Hold 

TAcTy 

Access Type (1:0) 

TAT2 

Access Type2 

TMWr 

Memory Write 

TExc 

Exception 



Load= 25pF 


Load= 25pF 


Load= 25pF 


Load= 25pF 


Stall Operation 


TSAVal 

Address Valid 

Load= 25pF 

— 

30 

— 

23 

— 

20 

— 

15 

ns 

TSAcTy 

Access Type 

Load= 25pF 

— 

27 

— 

23 

— 

18 

— 

13.5 

ns 

TMRdi 

Memory Read Initiate 

Load= 25pF 

1 

27 

1 

23 

1 

18 

1 

13.5 

ns 

TMRdT 

Memory Read Terminate 

Load= 25pF 

1 

2 

1 

23 

1 

5 

1 

13.5 

ns 

Tsti 

Run Terminate 

Load= 25pF 

3 

17 

3 

15 

3 

10 

2 

El 

ns 

TRun 

Run Initiate 

Load= 25pF 

— 

7 

— 

6 

— 

4 

— 

3 

ns 

TSMWr 

Memory Write 

Load= 25pF 

3 

27 

3 

23 

3 

18 

2 

9.5 

ns 

TSEc 

Exception Valid 

Load= 25pF 

— 

15 

— 

13 

— 

10 

— 

El 

ns 

TDMADis 

DMA Drive On 

Load= 25pF 

3 

15 

3 

15 

3 

15 

3 

15 

ns 

TDMAEn 

DMA Drive Off 

Load= 25pF 

— 

10 

— 

10 

— 

10 

— 

10 

ns 


Reset Initialization 


Trst 

Reset Pulse Width 

Trsttag 

Reset Pulse Width. Pull-downs 
on Tag 




Capacitive Load Deration 


Cld Load Derate® 


NOTES: 

1 . All timings are referenced to 1 ,5V. 

2. The dock parameters apply to all three 2xClocks: Clk2xSys, Clk2xSmp/Rd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. These parameters are illustrated in detail in the "IDT79R3001 Hardware Interface Guide'. 

5. Tcyc is one CPU clock cycl e (2 cycles of a 2x dock). 

6. With the exception of Run , no two signals on a given device will derate for a given load by a difference greater than 15%. 

7. Transition time <2. 5ns for 33MHz; <5ns for lower speeds. 


0.5 

1 

0.5 



ns/25pF 


2873 tbl 11 
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IDT79R3001 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS MILITARY AND COMMERCIAL TEMPERATURE RANGES 

AC ELECTRICAL CHARACTERISTICS^ 1 - 4 ) 

COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V ±5%) 



5 


NOTES: 2873 tbl 12 

1 . All timings are referenced to 1 ,5V. 

2. The dock parameters apply to all three 2xClocks: Clk2xSys, Clk2xSmp/Rd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. These parameters are illustrated in detail in the "IDT79R3001 Hardware Interface Guide". 

5. Tcyc is one CPU clock cycl e (2 cydes of a 2x dock). 

6. With the exception of Run , no two signals on a given device will derate for a given load by a difference greater than 1 5%. 
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IDT79R3001 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS* 1 - 4 > 

MILITARY TEMPERATURE RANGE (Tc = -55°C to +125°C, Vcc = +5.0V ±10%) 


Symbol 

Parameter 

Clock 

TCkHigh 

Input Clock High* 2 ) 

TCkLow 

Input Clock Low® 

TCkP 

Input Clock Period® 


Clk2xSys to Clk2xSmp/Rd® 

Clk2xSmp/Rd to Clk2xPhi® 


Test Conditions 


16.67MHz 


Min. Max, 


20.0MHz 




25.0MHz 


Max. 


Run Operation 


TDEn 

Data Enable® 

TDDIs 

Data Disable® 

TDVal 

Data Valid 

TWrOly 

Write Delay 

Tds 

Data Set-up 

Tdh 

Data Hold 

Tcbs 

CpBusy Set-up 

Tcbh 

CpBusy Hold 

TAcTy 

Access Type (1:0) 

TAT2 

Access Type2 

TMWr 

Memory Write 

TExc 

Exception 



Load= 25pF 


Load= 25pF 


Load= 25pF 


Load= 25pF 


Stall Operation 


TSAVal 

Address Valid 

Load= 25pF 

TSAcTy 

Access Type 

Load= 25pF 

TMRdi 

Memory Read Initiate 

Load= 25pF 

TMRdT 

Memory Read Terminate 

Load= 25pF 

Tsti 

Run Terminate 

Load= 25pF 

TRun 

Run Initiate 

Load= 25pF 

TSMWr 

Memory Write 

Load= 25pF 

TSEc 

Exception Valid 

Load= 25pF 

TDMADis 

DMA Drive On 

Load= 25pF 

TDMAEn 

DMA Drive Off 

Load= 25pF 


Reset Initialization 


Trst 

Reset Pulse Width 

Trsttag 

Reset Pulse Width. Pull-downs 
on Tag 




Capacitive Load Deration 


Cld Load Derate® 


NOTES: 

1 . All timings are referenced to 1 ,5V. 

2. The dock parameters apply to all three 2xClocks: Clk2xSys, Clk2xSmp/Rd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. These parameters are illustrated in detail in the "IDT79R3001 Hardware Interface Guide”. 

5. Tcyc is one CPU clock cycl e (2 cycles of a 2x dock). 

6. With the exception of Run , no two signals on a given device will derate for a given load by a difference greater than 1 5%. 

7. Transition time <2.5ns for 33MHz; <5ns for lower speeds. 


ns/25pF 


2873 tbl 13 
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IDT79R3001 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


PIN CONFIGURATIONS 

172-Pin Ceramic Flatpack (Cavity Side View) 



un ^ Oi-cM° ,-w tO'J n ' rin °o o o 

Wr-Q.CMT-T-t-^r-r-T-Q.raa30l’-®r-T-T-^^^»-i-^^^PT-(7)C0(DU)'fC0Q : h'(MT-O O O 

rtrflrtrtQooflQooo^QQooD 01 - 1 ^ £5£3i3f3i3i3f3i2i35 £2 £3 -d -d 
(5SflfO(fl(Ofl(5«(flrt(ii2Qn(flZoQQZZZQQZZZ(Dcfl[0(0(S(3(0(0(Brafl(3r3'DT3 
QQQQQQQQQQQQ0>>Q0>>>C300>>00Ca:QQQQQQQQQ0QQ<< 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 



44 86 


Data21 1 

43 87 

— EJ AdrLo2 

Data22 EE 


EEI AdrLo3 

Data24 1 


1 AdrLo4 

Data25 1 


1 AdrLo5 

Data26 1 


— 1 AdrLo 6 

Data31 1 — 


EE AdrLo7 

DataP3 1 


1 AdrLo 8 

Data27 1 


1 AdrLo9 

Data28 1 


1 AdrLolO 

XEn CE 


1 AdrLoll 

Data29 1 — 


EE AdrLo12 

Data30 1 


1 AdrLo13 

Exc (EE 


EE AdrLo14 

Clk2xPhi t= 


EEI VCC15 

GND7 CE 


EE VCC16 

GND 6 CE 


EE VCC17 

CpCond2 1—11 


EE GND16 

VCC7 CE 


EE GND17 

VCCG CE 


EE VCC18 

GND5 CE 


EE VCC19 

GND4 CE 

IDT79R3001 RISController 

EE GND18 

GND3 CE 



VCC5 CE 



VCC4 CE 


=Es:3 

VCC3 CE 


— 3 AdrLo15 

GND2 CE 


1 CpCondO 

GND1 CE 


1 CpCondl 

Clk2xSys Cl 


— 1 Resvdl 

CpSync 1 


EE GND19 

MemWr 1 


EE GND20 

AccTyl CE 


EE AdrLo16 

Run 1 — 


ED AdrLo17 

VCC2 Ed 


1 Into 

VCCt 1= 


EE Inti 

2xSmp/Rd EH 


EE Int2 

SysOut 1 — - 


1 Int3 

DCIk EE 


1 Int4 

ICIk EE 


EE Int5 

CpCond3 1 — 


1 CpBusy 

MemRd 1 


1 WrBusy 

AccTyO 1 


1 RdBusy 

AcctTy2 1 


1 Bus Error 

DMAStall L — 

1 129 

1 Reset 


172 130 



UUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUGUUffffUUmU 




CO CM ■»— 0005 r- o N to Cl CO N CM ID tTCDCD Lf) CO CM IfJ -O' CO M'CO(\jt-OOCONCDID'TCO 


NOTE: 

1 . AccTyp2 is redefined to be Parity Error if the parity enable option is selected at device initialization. 
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IDT79R3O01 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS MILITARY AND COMMERCIAL TEMPERATURE RANGES 


PIN CONFIGURATIONS (Continued) 
144-Pin PGA (Top View) 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 



AdrLo 

6 

AdrLo 

10 

AdrLo 

11 


AdrLo 

14 

AdrLo 

15 

CpCond 

0 

AdrLo 

16 

AdrLo 

17 






A 

VCC14 

VC Cl 2 

lnt(2) 

lnt(5) 

Wr 

Busy 

Reset 

VCC10 

B 

AdrLo 
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NOTE: 

1 . AccTyp2 is redefined to be Parity Error if the parity enable option is selected at device initialization. 
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Figure 16. Input Clock Timing 



* These signals are not actually output from the processor. They are drawn to provide 
a reference for other timing diagrams. 



Figure 17. Processor Reference Clock Timing 
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Figure 18. Synchronous Memory (Cache) Timing 


5.2 


26 














IDT79R3001 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 



RUN 

STALL 

STALL 

FIXUP 

RUN 

Phase 

1 

2 

1 

2 

1 

2 

1 

2 

1 



Figure 19. Memory Write Timing 
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Figure 20. Memory Read Timing 
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Figure 21. Co-Processor Load/Store Timing 
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Figure 22. Interrupt Timing 
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Figure 23. Mode Vector Initialization 


NOTES: 

1 . Reset must be n egated synchronously; however, it can be asserted asynchronously. Designs must not rely on the proper functioning of SysOut prior to 
th e assertion of R eset. 

2. If Phase-Lock On or is asserted as mode select options, they should be asserted throughout the Reset period, to insure that the slowest coprocessor in 
the sy stem has sufficient time to lock to the CPU clocks. 

3. Reset is actually sampled in both Phase 1 and Phase 2. To insure proper initialization, it must be negated relative to the end of Phase 1 . 


5.2 


30 










IDT79R3001 

RISController FOR HIGH-PERFORMANCE EMBEDDED SYSTEMS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


Phase 

SysOut 

PhiOut 

DMA Stall 

Run 

DRd 

DWr 

IRd 

IWr 

XEn 

ICIk 

DCIk 

AdrLo 



Figure 24. Entering DMA Stall 
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RISC FLOATING POINT 

IDT79R3010A 

ACCELERATOR (FPA) 

IDT79R3010AE 



Integrated Device Technology, Inc. 


FEATURES: 

• Hardware Support of Single- and Double-Precision 
Operations: 

— Floating-Point Add 

— Floating-Point Subtract 

— Floating-Point Multiply 

— Floating-Point Divide 

— Floating-Point Comparisons 

— Floating-Point Conversions 

• Sustained performance: 

— 1 1 MFLOPS single precision LINPACK 

— 7.3 MFLOPS double precision LINPACK 

• 1 6.7MHz through 40 MHz operation 

• Direct, high-speed interface with IDT79R3000A and 
IDT79R3001 Processor 

• Supports Full Conformance With IEEE 754-1985 Floating- 
PointSpecification 

• Full 64-bit operation using sixteen 64-bit data registers 

• High-speed CEMOS™ technology 

• Military product compliant to MIL-STD-883, Class B 

• 32-bit status/control register providing access to all IEEE- 
Standard exception handling 


• Load/store architecture allows data movement directly 
between FPA and memory or between CPU and FPA 

• Overlapped operation of independent floating point ALUs 

• Fully pin-compatible with IDT79R3010/IDT79R3010L 

DESCRIPTION: 

The IDT79R3010A Floating-Point Accelerator (FPA) op- 
erates in conjunction with the IDT79R3000A Processor and 
extends the IDT79R3000A’s instruction set to perform arith- 
metic operations on values in floating-point representations. 
The IDT79R3010A FPA, with associated system software, 
fully conforms to the requirements of ANSI/IEEE Standard 
754-1985, “IEEE Standard for Binary Floating-Point Arith- 
metic.” In addition, the architecture fully supports the standard’s 
recomendations. 

This data sheet provides an overview of the features and 
architecture of the 79R301 0A FPA. A more detailed descrip- 
tion of the operation of the device is incorporated in the 
“R3000A Family Hardware User’s Manual,” and a more de- 
tailed architectural overview is provided in the “mips RISC 
Architecture" book, both available from IDT. 



Figure 1. IDT79R3010A Functional Block Diagram 


CEMOS is a trademark ol Integrated Device Technology. Inc. 

MILITARY AND COMMERCIAL TEMPERATURE RANGES DECEMBER 1990 


© 1990 Integrated Device Technology, Inc. 
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IDT79R3010A FPA REGISTERS 

The IDT79R3010A FPA provides 32 general purpose 32- 
bit registers, a Control/Status register, and a Revision Iden- 


General Purpose Registers 
(FGR/FPR) 

63 32 31 0 


FGR1 

FGRO 

FGR3 

FGR2 

FGR5 

FGR4 


ft 

• 


» 

FGR27 

FGR26 

FGR29 

FGR28 

FGR31 

FGR30 


tification register. The tightly-coupled coprocessor interface 
causes the register resources of the FPA to appear to the 
systems programmers as an extension of the CPU internal 
registers. The FPA registers are shown in Figure 2. 


Control/Status Register 

31 0 

I Exceptions/Enables/Modes I 


Implementation/Revision 
31 Register 0 
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Figure 2. IDT79R3010A FPA Registers 


Floating-point coprocessor operations reference three types 
of registers: 

• Floating-Point Control Registers (FCR) 

• Floating-Point General Registers (FGR) 

• Floating-Point Registers (FPR) 

Floating-Point General Registers (FGR) 

There are 32 Floating-Point General Registers (FGR) on 
the FPA. They represent directly-addressable 32-bit registers, 
and can be accessed by Load, Store, or Move Operations. 

Floating-Point Registers (FPR) 

The 32 FGRs described in the preceding paragraph are 
also used to form sixteen 64-bit Floating-Point Registers 
(FPR). Pairs of general registers (FGRs), for example FGRO 
and FGR1 (referto Figure 2) are physically combined to form 
a single 64-bit FPR. The FPRs hold a value in either single- 
or double-precision floating-point format. Double-precision 
format FPRs are formed from two adjacent FGRs. 

Floating-Point Control Registers (FCR) 

There are 2 Floating-Point Control Registers (FCR) on the 
FPA. They can be accessed only by Move operations and 
include the following: 

• Control/Status register, used to control and monitor 
exceptions, operating modes, and rounding modes; 

• Revision register, containing revision information about 
the FPA. 

COPROCESSOR OPERATION 

TheFPAcontinuallymonitorsthelDT79R3000Aprocessor 
instruction stream. If an instruction does not apply to the 
coprocessor, it is ignored; if an instruction does apply to the 
coprocessor, the FPA executes that instruction and transfers 
necessary result and exception data synchronously to the 
IDT79R3000A main processor. 


The FPA performs three types of operations: 

• Loads and Stores; 

• Moves; 

• Two- and three-register floating-point operations. 

Load, Store, and Move Operations 

Load, Store, and Move operations move data between 
memory or the IDT79R3000A Processor registers and the 
IDT79R3010A FPA registers. These operations perform no 
format conversions and cause no floating-point exceptions. 
Load, Store, and Move operations reference a single 32-bit 
word of either the Floating-Point General Registers (FGR) or 
the Floating-Point Control Registers (FCR). 

Floating-Point Operations 

The FPA supports the folowing single- and double-preci- 
sion format floating-point operations: 

• Add 

• Subtract 

• Multiply 

• Divide 

• Absolute Value 

• Move 

• Negate 

• Compare 

In addition, the FPA supports conversions between single- 
and double-precision floating-point formats and fixed-point 
formats. 

The FPA incorporates separate Add/Subtract, Multiply, 
and Divide units, each capable of independent and concurrent 
operation. Thus, to achieve very high performance, floating 
point divides can be overlapped with floating point multiplies 
and floating point additions. These floating point operations 
occur independently of the actions of the CPU, allowing 
furtheroverlapof integer and floating point operations. Figure 
3 illustrates an example of the types of overlap permissible. 
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DIV.S 







MUL.S 


STORE (SWC1) |g| 


NON FPU 


Only Load, Store, and Move operations 
are permitted in FPA during these cycles. 

Other FPA instructions can proceed during 
these cycles. However, two multiply or two 
divide operations cannot be overlapped. 

These cycles are free for integer operations 
in the CPU. 



MUL.S 
ADD.S 

STORE (SWC1) 

LOAD (LWC1) 
STORE (SWC1) 


NON FPU 

Figure 3. Examples of Overlapping Floating Point Operation 


Exceptions 

The IDT79R3010A FPA supports all five IEEE standard 
exceptions: 

• Invalid Operation 

• Inexact Operation 

• Division by Zero 

• Overflow 

• Underflow 

The FPA also supports the optional, Unimplemented Op- 
eration exception that allows unimplemented instructions to 
trap to software emulation routines. 

The FPA provides precise exception capability to the CPU; 
that is, the execution of a floating point operation which 
generates an exception causes that exception to occur at the 
CPU instruction which caused the operation. This precise 
exception capability is a requirement in applications and 
languages which provide a mechanism for local software 
exception handlers within software modules. 


OP 

Description 

1 

OP 

Description 


Load/Store/Move Instructions 



Computational Instructions 

LWC1 

Load Word to FPA 


ADD.fmt 

Floating-point Add 

SWC1 

Store Word from FPA 


SUB.fmt 

Floating-point Subtract 

MTC1 

Move Word to FPA 


MUL. fmt 

Floating-point Multiply 

MFC1 

Move Word from FPA 


DIV.fmt 

Floating-point Divide 

CTC1 

Move Control word to FPA 


ABS.fmt 

Floating-point Absolute value 

CFC1 

Move Control word from FPA 


MOV.fmt 

Floating-point Move 




NEG.fmt 

Floating-point Negate 


Conversion Instructions 



Compare Instructions 

CVT.S.fmt 

Floating-point Convert to Single FP 


C.cond.fmt 

Floating-point Compare 

CVT.D.fmt 

Floating-point Convert to Double FP 




CVT.W.fmt 

Floating-point Convert to fixed-point 





INSTRUCTION SET OVERVIEW 

All IDT79R301 OA instructions are 32 bits long and they can 

be divided into the folowing groups: 

• Load/Store and Move instructions move data between 
memory, the main processor and the FPA general 
registers. 

• Computational instructions perform arithmetic operations 
on floating point values in the FPA registers. 

• Conversion instructions perform conversion operations 
between the various data formats. 

• Compare instructions perform comparisons of the con- 
tents of registers and set a condition bit based on the 
results. The result of the compare operation is output on the 
FpCond output of the FPA, which is typically used as 
CpCondl on the CPU for use in coprocessor branch 
operations. 

Table 1 lists the instruction set of the IDT79R3010A FPA. 


Table 1. IDT79R3010A Instruction Summary 
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ID79R3010 PIPELINE ARCHITECTURE 

The IDT79R3010A FPA provides an instruction pipeline 
that parallels that of the IDT79R3000A processor. The FPA, 
however, has a 6-stage pipeline instead of the 5-stage pipeline 
of the IDT79R3000: the additional FPA pipe stage is used to 
provide efficient coordination of exception responses between 
the FPA and main processor. 

The execution of a single IDT79R3010A instruction con- 
sists of six primary steps: 

1) IF — Instruction Fetch. The main processor calculates the 
insruction address required to read an instruction from the 
1-Cache. No action is required of the FPA during this pipe 
stage since the main processor is responsible for address 
generation. 

2) RD — The instruction is present on the data bus during 
phase 1 of this pipe stage and the FPA decodes the 


instruction on the bus to determine if it is an instruction for 
the FPA. 

3) ALU — If the instruction is an FPA instruction, instruction 
execution commences during this pipe stage. 

4) MEM — If this is a coprocessor load or store instruction, the 
FPA presents or captures the data during phase 2 of this 
pipe stage. 

5) WB— The FPA uses this pipe stage solely to deal with 
exceptions. 

6) FWB — The FPA uses this stage to write back ALU results 
to its register file. This stage is the equivalent of the WB 
stage in the IDT79R3000A main processor. 

Each of these steps requires approximately one FPA cycle 
as shown in Figure 3 (parts of some operations spill over into 
another cycle while other operations require only 1/2 cycle). 
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Figure 4. IDT79R3010A Instruction Summary 
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Figure 5. IDT79R3010A Instruction Pipeline 
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The IDT79R3010A uses a 6-stage pipeline to achieve an 
instruction execution rate approaching one instruction per 
FPA cycle. Thus, execution of six instructions at a time are 
overlapped as shown in Figure 5. 


This pipeline operates efficiently because different FPA 
resources (address and data bus accesses, ALU operations, 
register accesses, and so on) are utilized on a non-interfering 
basis. 


5.3 


4 




IDT79R3010A/AE 

RISC FLOATING POINT ACCELERATOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


PACKAGE THERMAL SPECIFICATIONS 

The IDT79R3010A utilizes special packaging techniques 
to improve both the thermal and electrical characteristics of 
the floating point accelerator. 

In order to improve the electrical characteristics of the 
device, the package is constructed using multiple signal 
planes, including individual power planes and ground planes 
to reduce noise associated with high-frequency TTL parts. 

In order to improve the thermal characteristics of the 
floating point accelerator, the device is housed using cavity 
down packaging for the flatpack and the PGA (the J-bend 
CerQuad is cavity up). In addition, these packages incorpo- 
rate a copper-tungsten thermal slug designed to efficiently 
transfer heat from the die to the case of the package, and thus 
effectively lower the thermal resistance of the package. The 
use of an additional external heat sink affixed to the package 
thermal slug further decreases the effective thermal resis- 
tance of the package. 

The case temperature may be measured in any environ- 
ment to determine whether the device is within the specified 
operating range. The case temperature should be measured 


at the center of the top surface opposite the package cavity 
(the package cavity is the side where the package lid is 
mounted). 

The equivalent allowable ambient temperature, Ta, can be 
calculated using the thermal resistance from case to ambient 
(0ca) for the given package. The following equation relates 
ambient and case temperature: 

Ta = Tc - P*0ca 

where Pis the maximum powerconsumption, calculated by 
using the maximum Icc from the DC Electrical Characteristic 
section. 


Typical values for 0ca at various airflows are shown in 
Table 2 for the various CPU packages. 
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Table 2. Thermal Resistance (Oca) at Various Airflows 
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NOTE: 

1 . Reserved pins must not be connected. 
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PIN CONFIGURATION* 1 * 

(Ceramic, Cavity Down) - BOTTOM VIEW 
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NOTE: 

1 . Reserved pins must not be connected. 



5.3 





IDT79R3010A/AE 

RISC FLOATING POINT ACCELERATOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


PIN CONFIGURATION* 1 * 

84-L QUAD FLATPACK (CAVITY DOWN) 
TOP VIEW 
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NOTE: 

1. Reserved pins must not be connected. 
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PIN DESCRIPTIONS 


Pin Name 


Description 

Data (0-31) 

B3 

A multiplexed 32-bit bus used for instruction and data transfers on phase 1 and phase 2, respectively. 

DataP (0-3) 

o 

A 4-bit bus containing even parity over the data bus. Parity is generated by the FPA on stores. 

Run 

1 

Input to the FPA which indicates whether the processor-coprocessor system is in the run or stall state. 

Exception 

1 

Input to the FPA which indicates exception related status information. 

FpBusy 

□ 

Signal to the CPU indicating a request for a coprocessor busy stall. 

FpCond 

o 

Signal to the CPU indicating the result of the last comparision operation. 

Fplnt 

0 

Signal to the CPU indicating that a floating-point exception has occured for the current FPA instruction. 

Reset 

1 

Synchronous initialization input used to distinguish the processor-FPA synchronization period from the 
execution period. Reset must be synchronized by the leading edge of SysOutfrom the CPU. 

PiiOn 

i 

Input which during the reset period determines whether the phase lock mechanism is enabled and during the 
execution period determines the output timing model. 

FpPresent 

o 

Output which is pulled to ground through an impedance of approximately 0.5k ohms. By providing an external 
pullup on this line, an indication of the presence or absence of the FPA can be obtained. 

Clk2xSys 

1 

A double frequency clock input used for generating FpSysOut. 

Clk2xSmp 

n 

A double frequency clock input used to determine the sample point for data coming in to the FPA. 

Clk2xRd 

i 

A double frequency clock input used to determine the disable point for the data drivers. 

Clk2xPhi 

i 

A double frequency clock input used to determine the position of the internal phases, phase 1 and phase 2. 

FpSysOut 

o 

Synchronization clock from the FPA. 

FpSysIn 

1 

Input used to receive the synchronization clock from the FPA. 

FpSync 

1 

Input used to receive the synchronization clock from the CPU. 
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IDT79R3010A/AE 

RISC FLOATING POINT ACCELERATOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


ABSOLUTE MAXIMUM RATINGS 0 ’ 3) 


Symbol 

Rating 

Commercial 

Military 

Unit 

Vterm 

Terminal Voltage 
with Respect 
toGND 

-0,5 to +7.0 

-0.5 to +7.0 

1 

Ta, Tc 

Operating 

Temperature 

0 to +70«> 
(Ambient) 

0 to +90 (5 > 
(Case) 

-55 to +125 
(Case) 

1 

Tbias 

Case Temperature 
Under Bias 

-55 to+125< 4 > 
0 to +90< 5 > 

-65 to +135 

m 

Tstg 

Storage 

Temperature 

-55 to +125 

-65 to +155 

°c 

IlN 

Input Voltage 

-0.5 to +7.0 

-0.5 to +7.0 

V 


2873 tbl 04 


NOTE: 

1 . Stressesgreater than those listed under ABSOLUTE MAXIMUM RATI NGS 
may cause permanent damage to the device. This is a stress rating only 
and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended 
periods may affect reliability. 

2. Vin minimum = -3.0V for pulse width less than 15ns. 

Vin should not exceed Vcc +0.5 Volts. 

3. Notmorethanoneoutputshouldbeshortedatatime. Duration of the short 
should not exceed 30 seconds. 

4. 16-33 MHz only. 

5. 37-40 MHz only. 


AC TEST CONDITIONS 


RECOMMENDED OPERATING 
TEMPERATURE AND SUPPLY VOLTAGE 


Grade 

Temperature 

GND 

Vcc 

Military 

-55°C to +125°C 
(Case) 

OV 

5.0 ±10% 

Commercial 
16-33 MHz 

0°C to +70°C 
(Ambient) 

ov 

5.0 ±5% 

Commercial 
37-40 MHz 

0°C to +90°C 
(Case) 

OV 

5.0 ±5% 


2873 tbl 06 


OUTPUT LOADING FOR AC TESTING 



Symbol 

Parameter 

Min. 

Max. 

Unit 

VlH 

Input HIGH Voltage 

3.0 

— 

V 

VlL 

Input LOW Voltage 

— 

0.4 

V 

VlHS 

Input HIGH Voltage 

3.5 

— 

V 

VlLS 

Input LOW Voltage 

— 

0.4 

V 

VlHC 

Input HIGH Voltage 

4.0 

— 

V 

VlLC 

Input LOW Voltage 

— 

0.4 

V 


2873 tbl 05 
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IDT79R3010A/AE 

RISC FLOATING POINT ACCELERATOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS FOR IDT79R3010A 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to + 70°C, Vcc = + 5.0 V ± 5%) 


Test Conditions 


Vcc = Min, Ioh = -4mA 


Vcc = Min, Iol = 4mA 


Vcc = Min, Iol = 1.5mA 



Parameter 

VOH 

Output HIGH Voltage 

VOL 

Output LOW Voltage 

VOLFP 

Output LOW Voltage* 5 ) 

Vih 

Input HIGH Voltage* 6 ) 

Vil 

Input LOW Voltage* 1 ) 

Vihs 

Input High Voltage! 2 ' 6 ) 

Vils 

Input LOW Voltage* 1 ' 2 > 

VlHC 

Input HIGH Voltage! 4 ' 6 ) 

Vilc 

Input LOW Voltage! 1 ' 4 ) 

Cin 

Input Capacitance! 7 ) 

Colit 

Output Capacitance! 7 ) 

Icc 

Operating Current 

IlH 

Input HIGH Leakage! 3 ) 

IlL 

Input LOW Leakage! 3 ) 

loz 

Output Tri-state Leakage 



Vcc = 5.0V, Ta = 70°C 


Vih = Vcc 


Vil = GND 


VOH = 2.4V, VOL = 0.5V 


DC ELECTRICAL CHARACTERISTICS FOR IDT79R3010AE 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to + 70°C, Vcc = + 5 .0 V ± 5%) 


Test Conditions 


Vcc = Min, lOH = -4mA 


Vcc = Min, Iol = 4mA 


Vcc = Min, Iol = 1.5mA 



Parameter 

VOH 

Output HIGH Voltage 

VOL 

Output LOW Voltage 

VOLFP 

Output LOW Voltage* 5 ) 

Vih 

Input HIGH Voltage* 6 ) 

Vil 

Input LOW Voltage! 1 ) 

Vihs 

Input High Voltage! 2 ' 6 ) 

Vils 

Input LOW Voltage! 1 ' 2 ) 

VlHC 

Input HIGH Voltage! 4 ' 6 ) 

Vilc 

Input LOW Voltage! 1 ' 4 ) 

Cin 

Input Capacitance! 7 ) 

Cout 

Output Capacitance* 7 ) 

Icc 

Operating Current 

IlH 

Input HIGH Leakage* 3 ) 

IlL 

Input LOW Leakage! 3 ) 

loz 

Output Tri-state Leakage 

NOTES: 



Vcc = 5.0V, Ta = 70°C 


Vih = Vcc 


Vil = GND 


VOH = 2.4V, VOL = 0.5V 


1. Vil Min. = -3.0V for pulse width less than 15ns. Vil should not fall below -0 .5V for larger period s. 

2. Vihs and Vils apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, FpSysin, FpSync and Reset. 

3. These parameters do n ot apply to the c lock inputs . 

4. VIHC and Vilc apply t o Run, PilO n and Exception. 

5. VOLFP applies to the FPPresent pin only. 

6. Vih and Vihs should not be held above Vcc + 0.5 Volts. 

7. Guaranteed by design. 
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IDT79R3O10A/AE 

RISC FLOATING POINT ACCELERATOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS FOR IDT79R3010AE 
COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to + 90°C, Vcc = + 5.0 V ± 5%) 


HISS!! 

Parameter 

Test Conditions 

37 MHz 

40 MHz 

Unit 

Min. 

Max. 

Min. 

Max. 

VOH 

Output HIGH Voltage 


3.5 

— 

3.5 

-si. 

n 

VOL 

Output LOW Voltage 

Vcc = Min, Iol = 4mA 

— 

0.4 

— 

0.4 

D 

VOLFP 

Output LOW Voltage< 5 ) 

Vcc = Min, Iol = 1.5mA 

— 

0.5 

— 

0.5 

D 

VlH 

Input HIGH Voltage! 6 ) 


2.0 

— 

2.0 

s : . — 

D 

VlL 

Input LOW Voltage! 1 ) 


— 

0.8 

— •. % 

' r 0.8 

D 

Vihs 

Input High Voltage! 2 ’ 6 ) 


3.0 

— 

3.0 .. . 

— 

D 

Vils 

Input LOW Voltage! 1 ’ 2 ) 


— 

O’ 4 

— 

0.4 

m 

VlHC 

Input HIGH Voltage! 4 ’ 6 ) 


4.0 

— 

- 4.0 

— 

D 

Vilc 

Input LOW Voltage! 1 ' 4 ) 


— 

0.4 

p _ 

0.4 


Cin 

Input Capacitance! 7 ) 


— 

4|:10 JP* 

— 

10 

O 

CO LIT 

Output Capacitance! 7 ) 


— 

::f 10 : 

— 

10 

□ 

ICC 

Operating Current 

Vcc = 5.0V, Tc = 90°C 

— J:|;: 

725 

— 

750 


IlH 

Input HIGH Leakage! 3 ) 

VlH = Vcc 


W loo 

— 

100 


IlL 

Input LOW Leakage! 3 ) 

Vil = GND 

-100 

— 

-1 00 

— 


loz 

Output Tri-state Leakage 

VoH = 2.4V, VOL = 0.5V 

-1 00 : 

100 

-100 

100 



1. Vii Min. = -3.0V for pulse width less than 15ns. Vil should not fall below -0 . 5V for la rger period s. 

2. Vihs and Vils apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, FpSysin, FpSync and Reset. 

3. These parameters do n ot apply to the clock inputs . 

4. VIHC and Vilc apply to Run, PIIQ n and Exception. 

5. VOIFP applies to the FP Present pin only. 

6. Vih and Vihs should not be held above Vcc + 0.5 Volts. 

7. Guaranteed by design. 
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IDT79R3010A/AE 

RISC FLOATING POINT ACCELERATOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS FOR IDT79R3010A 



MILITARY TEMPERATURE RANGE (Tc =-55°C to + 125°C, Vcc = + 5.0 V± 10%) 


Hi 

Parameter 

Voh 

Output HIGH Voltage 

Vol 

Output LOW Voltage 

VOLFP 

Output LOW Voltage* 5 * 

VIH 

Input HIGH Voltage* 6 * 

VIL 

Input LOW Voltage* 1 ) 

Vihs 

Input High Voltage* 2 ' 6 ) 

Vils 

Input LOW Voltage* 1 ’ 2 ) 

VlHC 

Input HIGH Voltage* 4 ' 6 ) 

Vilc 

Input LOW Voltage* 1 ' 4 ) 

ClN 

Input Capacitance* 7 ) 

COUT 

Output Capacitance* 7 ) 

ICC 

Operating Current 

IlH 

Input HIGH Leakage* 3 ) 

IlL 

Input LOW Leakage* 3 ) 

loz 

Output Tri-state Leakage 



2873 tbl 10 


DC ELECTRICAL CHARACTERISTICS FOR IDT79R3010AE — 

MILITARY TEMPERATURE RANGE (Tc =-55°C to + 125°C, Vcc = + 5.0 V± 10%) 


| 25.0 MHz 

Test Conditions 


Vcc = Min, lOH = -4mA 


Vcc = Min, Iol = 4mA 


Vcc = Min, Iol = 1.5mA 


Symbol 

Parameter 

VOH 

Output HIGH Voltage 

VOL 

Output LOW Voltage 

VOLFP 

Output LOW Voltage* 5 ) 

Vih 

Input HIGH Voltage* 6 ) 

Vil 

Input LOW Voltage* 1 ) 

Vihs 

Input High Voltage* 2 ' 6 ) 

Vils 

Input LOW Voltage* 1 ' 2 ) 

VlHC 

Input HIGH Voltage* 4 ' 6 ) 

Vilc 

Input LOW Voltage* 1 ’ 4 ) 

Cin 

Input Capacitance* 7 ) 

Cout 

Output Capacitance* 7 ) 

Icc 

Operating Current 

IlH 

Input HIGH Leakage* 3 ) 

IlL 

Input LOW Leakage* 3 ) 

loz 

Output Tri-state Leakage 




VCC = 5.0V, Ta = 70°C 


Vih = Vcc 


Vil = GND 


Voh = 2.4V, VOL = 0.5V 



1 . Vil Min. = -3.0V for pulse width less than 1 5ns. Vil should not fall below -0 . 5V for la rger period s. 

2. Vihs and Vils apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, FpSysin, FpSync and Reset. 

3. These parameters do n ot apply to the c lock inputs . 

4. VlHC and Vilc apply to Run, PIIQ n and Exception. 

5. VOLFP applies to the FP Present pin only. 

6. Vih and Vihs should not be held above Vcc + 0.5 Volts. 

7. Guaranteed by design. 
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IDT79R3O10A/AE 

RISC FLOATING POINT ACCELERATOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS FOR IDTyQRSOlOA^ 1 - 3 > 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V ± 5%) 


Symbol 

Parameter 

Test Conditions 

16.67 MHz 

20.0 MHz 

Unit 

Min. 

Max. 

Min. 

Max. 

Clock | 

TCkHigh 

Input Clock High* 2 * 

Note 7 

12 

— 

10 

— 

ns 

TCkLow 

Input Clock Low* 2 ) 

Note 7 

12 

— 

10 

— 

ns 

TCkP 

Input Clock Period 


30 

1000 

25 

1000 

mm 


Clk2xSys to Clk2XSmp* 5 ) 


0 

tcyc/4 

0 

tcyc/4 

■s 


Clk2xSmp to Clk2xRd< 5 ) 


0 

tcyc/4 

0 

tcyc/4 

ns 


Clk2xSmp to Clk2xPhi* 5 ) 


9 

tcyc/4 

7 

tcyc/4 

ns 

Timing Paramters 

TDEn 

Data Enable* 3 ) 


— 

-2 

— 

-2 

ns 

TDDIs 

Data Disable* 3 ) 


— 

-1 

— 

-1 

ns 

TDVal 

Data Valid 

Load= 25pF 

— 

3 

— 

3 

ns 

Trsds 

Reset Set-up 


15 

— 

15 

— 

ns 

Tds 

Data Set-up 


9 

— 

8 

— 

ns 

Tdh 

Data Hold* 3 ) 


-2.5 

— 

-2.5 

— 

ns 

TFpCond 

Fp Condition 


— 

35 

— 

30 

ns 

TFpBusy 

Fp Busy 


— 

15 

— 

13 

ns 

TFpInt 

Fp Interrupt 


— 

40 

— 

35 

ns 

TFpMov 

Fp Move To 


— 

35 

— 

30 

ns 

TRExS 

Exception Set-up (Run Cycle) 


14 

— 

12 

— 

ns 

TSExS 

Exception Set-up (Stall Cycle) 


12 

— 

10 

— 

ns 

TExH 

Exception Hold 


0 

— 

0 

— 

ns 

TRunS 

Run Set-up 


17 

— 

15 

— 

ns 

TRunH 

Run Hold 


-2 

— 

-2 

— 

ns 

TstallS 

Stall Set-up 


10 

— 

10 

— 

ns 

TStallH 

Stall Hold 


-2 

— 

-2 



ns 

Reset Initialization 

TrstPLL 

Reset Timing, Phase-lock on* 4 ' 5) 


3000 

— 

3000 

— 

Tcyc 

Trst 

Reset Timing, Phase-lock off* 5 ) 


128 

— 

128 

— 

Tcyc 

Capacitive Load Deration 

CLD 

Load Derate* 6 ) 


0.5 

2 

0.5 

1 

ns/25pF 


1 . All timings are referenced to 1 ,5V. 

2. The dock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This param eter is gu arantee d by design. 

4. With PllOn asserted, Reset must be asserted for the longer of 3000 clock cycles or 200 microseconds. 

5. Tcyc is one CPU clock cycle (two cycles of a 2x clock). 

6. No two signals on a given device will derate for a given load by a difference greater than 1 5%. 

7. Clock transition time < 5ns. 
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IDT79R3010A/AE 

RISC FLOATING POINT ACCELERATOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS FOR IDTTSRSOIOAE* 1 - 3 > 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V ± 5%) 


Symbol 

Parameter 

Test Conditions 

25.0 MHz 

33.33 MHz 

Unit 

Min. 

Max. 

Min. 

Max. 

Clock 

TCkHigh 

Input Clock High* 2 ) 

Note 7 

8 

— 

6 

— 

ns 

TCkLow 

Input Clock Low* 2 ) 

Note 7 

8 

— 

6 

— 

ns 

TCkP 

Input Clock Period 


20 

1000 

15 

1000 

mm 


Clk2xSys to Clk2XSmp* 5 ) 


0 

tcyc/4 

0 

tcyc/4 

■■ 


Clk2xSmp to Clk2xRd* 5 > 


0 

tcyc/4 

0 

tcyc/4 

ns 


Clk2xSmo to Clk2xPhi* 5 > 


5 

tcyc/4 

3.5 

tcyc/4 

ns 

Timing Paramters 

TDEn 

Data Enable* 3 ) 


— 

-1.5 

— 

-1 

ns 

TDDIs 

Data Disable* 3 ) 


— 

-0.5 

— 

-0.5 

ns 

TDVal 

Data Valid 

Load= 25pF 

— 

2 

— 

2 

ns 

Trsds 

Reset Set-up 


10 

— 

10 

— 

ns 

Tds 

Data Set-up 


6 

— 

4.5 

— 

ns 

Tdh 

Data Hold* 3 ) 


-2.5 

— 

-2.5 

— 

ns 

TFpCond 

Fp Condition 


— 

25 

— 

17 

ns 

TFpBusy 

Fp Busy 


— 

10 

— 

7 

ns 

TFpInt 

Fp Interrupt 


— 

25 

— 

18 

ns 

TFpMov 

Fp Move To 


— 

25 

— 

16 

ns 

TRExS 

Exception Set-up (Run Cycle) 


11 

— 

9 

— 

ns 

TSExS 

Exception Set-up (Stall Cycle) 


8 

— 

6.5 

— 

ns 

TExH 

Exception Hold 


0 

— 

0 

— 

ns 

TRunS 

Run Set-up 


15 

— 

12.5 

— 

ns 

TRunH 

Run Hold 


-2 

— 

-1.5 

— 

ns 

T Stalls 

Stall Set-up 


9 

— 

7 

— 

ns 

TStallH 

Stall Hold 


-2 

— 

-2 

— • 

ns 

Reset Initialization 

TrstPLL 

Reset Timing, Phase-lock on* 4 ’ 5 > 


3000 

— 

3000 

— 

Tcyc 

Trst 

Reset Timing, Phase-lock off* 5 ) 


128 

— 

128 

— 

Tcyc 

Capacitive Load Deration 

CLD 

Load Derate* 6 ) 


0.5 

1 

0.5 

1 

ns/25pF 


1 . All timings are referenced to 1 .5V. 

2. The dock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This param eter is gu arantee d by design. 

4. With PllOn asserted, Reset must be asserted for the longer of 3000 clock cycles or 200 microseconds. 

5. Tcyc is one CPU clock cycle (two cycles of a 2x clock). 

6. No two signals on a given device will derate for a given load by a difference greater than 15%. 

7. Clock transition time < 2.5ns for 33MHz; clock transition time < 5ns for all other speeds. 
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IDT79R3O10A/AE 

RISC FLOATING POINT ACCELERATOR MILITARY AND COMMERCIAL TEMPERATURE RANGES 

AC ELECTRICAL CHARACTERISTICS FOR IDT79R3010AE^> 3 ) 

COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V ± 5%) 



NOTES: 

1. All timings are referenced to 1 ,5V. 

2. The dock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Cik2xRd, and Clk2xPhi. 

3. This param eter is gu arantee d by design. 

4. With PllOn asserted, Reset must be asserted for the longer of 3000 clock cycles or 200 microseconds. 

5. Tcyc is one CPU clock cyde (two cycles of a 2x clock). 


6. No two signals on a given device will derate for a given load by a difference greater than 15%. 

7. Clock transition time < 2.5ns. 
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IDT79R3010A/AE 

RISC FLOATING POINT ACCELERATOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS FOR IDT79R3010A^ 1,3 ) 
MILITARY TEMPERATURE RANGE (Tc = -55°C to +125°C, Vcc = +5.0V ± 10%) 


Symbol 

Parameter 

| Clock 

TCkHigh 

Input Clock High* 2 ' 

TCkLow 

Input Clock Low* 2 ' 

TCkP 

Input Clock Period 

Clk2xSys to Clk2XSmp* 5 ' 
Clk2xSmp to Clk2xRd* 5 ' 
Clk2xSmo to Clk2xPhi* 5 ' 


16.67 MHz 


20.0 MHz 


Test Conditions 


Timing Paramters 


TDEn 

Data Enable* 3 ' 

TDDIs 

Data Disable* 3 ' 

TDVal 

Data Valid 

Trsds 

Reset Set-up 

Tds 

Data Set-up 

Tdh 

Data Hold* 3 ' 

TppCond 

Fp Condition 

TFpBusy 

Fp Busy 

TFpInt 

Fp Interrupt 

TFpMov 

Fp Move To 

TRExS 

Exception Set-up (Run Cycle) 

TSExS 

Exception Set-up (Stall Cycle) 

TExH 

Exception Hold 

TRunS 

Run Set-up 

TRunH 

Run Hold 

TStallS 

Stall Set-up 

TStallH 

Stall Hold 


Reset Initialization 


TrstPLL 

Reset Timing, Phase-lock on* 4 ' 5 ' 

Trst 

Reset Timing, Phase-lock off* 5 ' 

Capacitive Load Deration 

CLD 

Load Derate* 6 ' j 



NOTES: 

1. All timings are referenced to 1 ,5V. 

2. The dock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This param eter is gu arantee d by design. 

4. With PllOn asserted, Reset must be asserted for the longer of 3000 clock cycles or 200 microseconds. 

5. Tcyc is one CPU clock cycle (two cycles of a 2x clock). 


ns/25pF 
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6. No two signals on a given device will derate for a given load by a difference greater than 1 5%. 

7. Clock transition time < 5ns 









































































































































IDT79R3010A/AE 

RISC FLOATING POINT ACCELERATOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS FOR IDTyQRSOIOAE^ 1 - 3 > 
MILITARY TEMPERATURE RANGE (Tc = -55°C to +125°C, Vcc = +5.0V± 10%) 


Symbol 

Parameter 

Test Conditions 

25.0 MHz 

33.33 MHz 

Unit 

Min. 

Max. 

Min. 

Max. 

Clock | 

TCkHigh 

Input Clock High! 2 ) 

Note 7 

8 

— 

6 


ns 

TCkLow 

Input Clock Low! 2 ) 

Note 7 

8 

— 

6 


ns 

TCkP 

Input Clock Period 


20 

1000 

15 

1000- 

mm 


Clk2xSys to Clk2XSmp! 5 ) 


0 

tcyc/4 

0 

tcyc/4 

■9 


Clk2xSmp to C!k2xRd! 5 ) 


0 

tcyc/4 

0 J 

ill tcyc/4 

ns 


Clk2xSmp to Clk2xPhi< 5 > 


5 

tcyc/4 

3.5 

' tcyc/4 

ns 

Timing Paramters 

UN: 

TDEn 

Data Enable! 3 ) 


— 

-1.5 

— it?®*? 

. "I 

ns 

TDDIs 

Data Disable! 3 ) 


— 

-0.5 

— 

-0.5 

ns 

TDVal 

Data Valid 

Load= 25pF 

— 

2 

— ... 

2 

ns 

Trsds 

Reset Set-up 


10 

— 

10 

— 

ns 

Tds 

Data Set-up 


6 

— 


— 

ns 

Tdh 

Data Hold< 3 ) 


-2.5 

— 

• : ::V : -.,-2.5 / 

— 

ns 

TFpCond 

Fp Condition 


— 

25 

— - 

17 

ns 

TFpBusy 

Fp Busy 


— 

10 

" 

7 

ns 

TFpInt 

Fp Interrupt 


— 

25 

' — 

18 

ns 

TFpMov 

Fp Move To 


— 

25 ^ 

— 

16 

ns 

TRExS 

Exception Set-up (Run Cycle) 


11 


9 

— 

ns 

TSExS 

Exception Set-up (Stall Cycle) 


8 

_ 

if 6.5 

— 

ns 

TExH 

Exception Hold 


0 

Jf 

0 

— 

ns 

TRunS 

Run Set-up 


15 

,>"v — . 

12.5 

— 

ns 

TRunH 

Run Hold 


-2 

• Jp-j?- 

-1.5 

— 

ns 

TStallS 

Stall Set-up 


9 


7 

— 

ns 

TStallH 

Stall Hold 


-2 


-2 

— 

ns 

| Reset Initialization 

TrstPLL 

Reset Timing, Phase-lock on! 4 ' 5) 


3000 : :*:^ 

• — 

3000 

— 

Tcyc 

Trst 

Reset Timing, Phase-lock off! 5 ) 


12|n 

— 

128 

— 

Tcyc 

| Capacitive Load Deration 

|CLD 

Load Derate! 6 ) 


0.5 

i 

0.5 


ns/25pF 


1. All timings are referenced to 1.5V. 

2. The dock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, C!k2xRd, and Clk2xPhi. 

3. This param eter is gu arantee d by design. 

4. With PllOn asserted, Reset must be asserted for the longer of 3000 clock cycles or 200 microseconds. 

5. Tcyc is one CPU clock cycle (two cycles of a 2x clock). 

6. No two signals on a given device will derate for a given load by a difference greater than 1 5%. 

7. Clock transition time < 2.5ns for 33MHz; clock transition time < 5ns for all other speeds. 
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Clk2xPhi 
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JsmR*, 
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Trd 
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Tsys 


X 


X 


X 
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Figure 6. Input “2x” Clock Timing 



Figure 7. Processor Reference Clock 

* These signals are not actually output from the floating point processor. 

They are drawn to provide a reference for other timing diagrams. 
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Phase 

FpSysOut 
FpPhiOut 

Data Bus 

Run 

Figure 8. Floating Point Load/Store Timing 
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Figure 10. Floating Point Interrupt Timing 



! FPCompareALU 

I. FPCompareMEM J 
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Figure 11. Floating Point Condition Timing 
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Figure 13. Power-On Reset Timing 
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ORDERING INFORMATION 


IDT XXXXX 
Device Type 


XX X X 

Speed Package Process/ 

I | Temperature 

Range 


Blank 

B 


M 


F 

G 

QJ 


16 

20 

25 

33 

37 

40 


Commercial (0°C to +70°C) 

Military (-55°C to +125°C) 

Compliant to MIL-STD-883, Class B 
Military Temperature Range Only 


84-Pin Quad Flatpack (Cavity Down) 
84-Pin PGA (Cavity Down) 

84-Pin J-Bend CerPack (Cavity Up) 


16.67 MHz 

20.0 MHz 

25.0 MHz 
33.33 MHz 
37 MHz 
40 MHz 

Floating Point Accelerator 

Enhanced Timing Floating Point Accelerator 
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DESCRIPTION: 

The IDT 79R3500A RISC Microprocessor consists of three 
tightly-coupled processors integrated on a single chip. The 
first processor is a full 32-bit CPU based on RISC (Reduced 
Instruction Set Computer) principles to achieve a new stan- 
dard of microprocessor performance. The second processor 
is a system control coprocessor, called CPO, containing a 
f ully-associative 64 entry TLB (T ranslation Lookaside Buffer) , 
MMU (Memory Management Unit) and control registers, sup- 
porting a 4 Gigabyte virtual memory subsystem, and a Harvard 
Architecture Cache Controller achieving a bandwidth of 320 
Mbytes/second using industry standard static RAMs. The 
third processor is the Floating Point Accelerator which per- 
forms arithmetic operations on values in floating-point repre- 
sentations. This processor fully conforms to the requirements 
of ANSI/IEEE Standard 754-1 985, “IEEE Standard for Binary 
Floating-Point Arithmetic.” In addition, the architecture fully 
supports the standard’s recommendations. 

The programmer model of this device will be the same as 
the programmer model of a system which uses a discrete 
79R3000 with the 79R3010: 32 integer registers, 16 floating 
point registers; co-processor 0 registers; floating point status 
and control register; RISC integer ALU; Integer Multiply and 
Divide ALU; Floating Point Add/Subtract, Multiply, and Divide 
ALUs. The device pipeline will be the same asforthe 79R3000, 
as will the co-processor 0 functionality. No new instructions 
have been introduced. Pin compatibility extends to AC and DC 
characteristics, software execution and initialization mode 
vector selection. 

This data sheet provides an overview of the features and 
architecture of the 79R3500A CPU, Revision 3.0. A more 
detailed description of the operation of the device is incorpo- 
rated in the “K3500A Family Hardware User Manual", and a 
more detailed architectural overview is provided in the “mips 
RISC Architecture” book, both available from IDT. Documen- 
tation providing details of the software and development 
environments supporting this processor are also available 
from IDT. 

IDT79R3500A CPU Registers 

The IDT79R3500A CPU provides 32 general purpose 32- 
bit registers, a 32-bit Program Counter, and two 32-bit regis- 
ters that hold the results of integer multiply and divide opera- 
tions. Only two of the 32 general registers have a special 
purpose; register rO is hardwired to the value “0”, which is a 
useful constant, and register r31 is used as the link register in 
jump-and-link instructions (return address forsubroutine calls). 

The CPU registers are shown in Figure 2. Note that there 
is no Program Status Word (PSW) register shown in this 
figure: the functions traditionally provided by a PSW register 
are instead provided in the Status and Cause registers incor- 
porated within the System Control Coprocessor (CPO). 


FPA REGISTERS 

The IDT79R3010A FPA provides 32 general purpose 32- 
bit registers, a Control/Status register, and a Revision Iden- 
tification register. 

Floating-point coprocessoroperations reference three types 
of registers: 

• Floating-Point Control Registers (FCR) 

• Floating-Point General Registers (FGR) 

• Floating-Point Registers (FPR) 


General Purpose Registers 



Floating-Point General Registers (FGR) 

There are 32 Floating-Point General Registers (FGR) on 
the FPA. They represent directly-addressable 32-bit registers, 
and can be accessed by Load, Store, or Move Operations. 

Floating-Point Registers (FPR) 

The 32 FGRs described in the preceding paragraph are 
also used to form sixteen 64-bit Floating-Point Registers 
(FPR). Pairs of general registers (FGRs), for example FGRO 
and FGR1 (refer to Figure 3) are physically combined to form 
a single 64-bit FPR. The FPRs hold a value in either single- or 
double-precision floating-point format. Double-precision for- 
mat FPRs are formed from two adjacent FGRs. 

Floating-Point Control Registers (FCR) 

There are 2 Floating-Point Control Registers (FCR) on the 
FPA. They can be accessed only by Move operations and 
include the following: 

• Control/Status register, used to control and monitor excep- 
tions, operating modes, and rounding modes; 

• Revision register, containing revision information about 
the FPA. 
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General Purpose Registers 
(FGR/FPR) 

63 32 31 0 

Control/Status Register 

31 0 

Exceptions/Enables/Modes 


Implementation/Revision 
31 Register 0 
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Figure 3. FPA Registers 


FGR1 

FGRO 

FGR3 

FGR2 

FGR5 

FGR4 

• 

• 

• 

FGR27 

FGR26 

FGR29 

FGR28 

FGR31 

FGR30 


Instruction Set Overview 

All IDT79R3500A instructions are 32 bits long, and there 
are only three instruction formats. This approach simplifies 
instruction decoding, thus minimizing instruction execution 
time. The 79R3500A processor initiates a new instruction on 
every run cycle, and is able to complete an instruction on 
almost every clock cycle. The only exceptions are the Load 
instructions and Branch instructions, which each have a single 
cycle of latency associated with their execution. Note, how- 
ever, that in the majority of cases the compilers are able to fill 
these latency cycles with useful instructions which do not 
require the result of the previous instruction. This effectively 
eliminates these latency effects. 

The actual instruction set of the CPU was determined after 
extensive simulations to determine which instructions should 
be implemented in hardware, and which operations are best 
synthesized in software from other basic instructions. This 
methodology resulted in the R3500A having the highest 
performance of any available microprocessor. 


1-Type (Immediate) 


31 26 
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20 16 

15 


0 

op 

rs 

rt 

immediate 


J-Type (Jump) 
31 26 25 


op 


target 


R-Type (Register) 


31 26 

25 21 

20 16 

15 11 

10 6 

5 0 

op 

rs 

rt 

rd 

re 

funct 


Figure 4. IDT79R3500A Instruction Formats 


The IDT79R3500A instruction set can be divided into the 

following groups: 

• Load/Store instructions move data between memory and 
general registers. They are all l-type instructions, since the 
only addressing mode supported is base register plus 16- 
bit, signed immediate offset. 

The Load instruction has a single cycle of latency, which 
means that the data being loaded is not available to the 
instruction immediately afterthe load instruction. The com- 
piler will fill this delay slot with either an instruction which is 
not dependent on the loaded data, or with a NOP instruc- 
tion. There is no latency associated with the store instruc- 
tion. 

Loads and Stores can be performed on byte, half-word, 
word, or unaligned word data (32 bit data not aligned on a 
modulo-4 address). The CPU cache is constructed as a 
write-through cache. 

• Computational instructions perform arithmetic, logical 
and shift operations on values in registers. They occur in 
both R-type (both operands and the result are registers) 
and l-type (one operand is a 1 6-bit immediate) formats. FP 
computational instructions perform arithmetic operations 
on floating point values in the FPA registers. Note that 
computational instructions are three operand instructions; 
that is, the result of the operation can be stored into a 
different register than either of the two operands. This 
means that operands need not be overwritten by arithmetic 
operations. This results in a more efficient use of the large 
register set. 

• Conversion instructions perform conversion operations 
on the floating point values in the FPA registers. 

• Compare intructions perform comparisons of the contents 
of FPA registers and set a condition bit based on the 
results. The result of the compare operations is tied directly 
to Cp Cond (1) for software testing. 

• Jump and Branch instructions change the control flow of 
a program. Jumps are always to a paged absolute address 
formed by combining a 26-bit target with four bits of the 
Program counter (J-type format, for subroutine calls), or 
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OP 

Description 

OP 

LB 

Load/Store Instructions 

Load Byte 

SRA 

LBU 

Load Byte Unsigned 

SLLV 

LH 

Load Halfword 

SRLV 

LHU 

Load Halfword Unsigned 

SRAV 

LW 

Load Word 


LWL 

Load Word Left 


LWR 

Load Word Right 

CVT.S.fmt 

SB 

Store Byte 

CVT.D.fmt 

SH 

Store Halfword 

CVT.W.fmt 

SW 

Store Word 


SWL 

Store Word Left 

MULT 

MULTU 

SWR 

Store Word Right 


FPA Load/Store/Move Instructions 

DIV 

LWC1 

Load Word to FPA 

DIVU 

SWC1 

Store Word from FPA 

MFHI 

MTC1 

Move Word to FPA 

MTHI 

MFC1 

Move Word from FPA 

MFLO 

CTC1 

Move Control word to FPA 

MTLO 

CFC1 

Move Control word from FPA 



Arithmetic Instructions 

J 


(ALU Immediate) 

JAL 

ADDI 

Add Immediate 

JR 

ADDIU 

Add Immediate Unsigned 

JALR 

SLT1 

Set on Less Than Immediate 

BEQ 

SLTIU 

Set on Less Than Immediate 

BNE 


Unsigned 

BLEZ 

ANDI 

AND Immediate 

BGTZ 

ORI 

OR Immediate 

BLTZ 

XORI 

Exclusive OR Immediate 

BGEZ 

LUI 

Load Upper Immediate 



Arithmetic Instructions 

BLTZAL 

BGEZAL 

ADD 

(3-operand, register-type) 

Add 


ADDU 

Add Unsigned 

SYSCALL 

SUB 

Subtract 

SUBU 

Subtract Unsigned 

BREAK 

SLT 

Set on Less Than 


SLTU 

AND 

Set on Less Than Unsigned 

AND 

LWCZ 

OR 

OR 

swcz 

XOR 

Exclusive OR 

MTCZ 

NOR 

NOR 

MFCZ 


FPA Computational Instructions 

CTCZ 

CFCZ 

ADD.fmt 

Floating point Add 

COPZ 

SUB.fmt 

Floating point Subtract 

BCZT 

MUL.fmt 

Floating point Multiply 

BCZF 

DIV.fmt 

Floating point Divide 


ABS.fmt 

Floating-point Absolute value 


MOV.fmt 

Floating point Move 

MTCO 

MFCO 

NEG.fmt 

Floating point Negate 


FPA Compare Instructions 

TLBR 

C.cond.fmt 

Floating-point Compare 

TLBWI 


Shift Instructions 

TLBWR 

TLBP 

SLL 

Shift Left Logical 

RFE 

SRL 

Shift Right Logical 



Description 

Shift Instructions (Cont.) 

Shift Right Arithmetic 
Shift Left Logical Variable 
Shift Right Logical Variable 
Shift Right Arithmetic Variable 

FPA Conversion Instructions 

Floating point Convert to Single FP 
Floating point Convert to Double FP 
Floating point Convert to fixed point 

Multiply/Divide Instructions 
Multiply 

Multiply Unsigned 
Divide 

Divide Unsigned 
Move From HI 
Move To HI 
Move From LO 
Move To LO 

Jump and Branch Instructions 
Jump 

Jump and Link 

Jump to Register 

Jump and Link Register 

Branch on Equal 

Branch on Not Equal 

Branch on Less than or Equal to Zero 

Branch on Greater Than Zero 

Branch on Less Than Zero 

Branch on Greater than or 

Equal to Zero 

Branch on Less Than Zero and Link 
Branch on Greater than or Equal to 
Zero and Link 

Special Instructions 
System Call 
Break 

Coprocessor Instructions 

Load Word from Coprocessor 
Store Word to Coprocessor 
Move To Coprocessor 
Move From Coprocessor 
Move Control to Coprocessor 
Move Control From Coprocessor 
Coprocessor Operation 
Branch on Coprocessor z True 
Branch on Coprocessor z False 

System Control Coprocessor 
(CPO) Instructions 
Move To CPO 
Move From CPO 
Read indexed TLB entry 
Write Indexed TLB entry 
Write Random TLB entry 
Probe TLB for matching entry 
Restore From Exception 


Table 1. IDT79R3500A Instruction Summary 
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32-bit register byte addresses (R-type, for returns and 
dispatches). Branches have 16-bit offsets relative to the 
program counter (l-type). Jump and Link instructions save 
a return address in Register31 . The 79R3500A instruction 
set features a numberof branch conditions. Included is the 
ability to compare a register to zero and branch, and also 
the ability to branch based on a comparison between two 
registers. Thus, net performance is increased since soft- 
ware does not have to perform arithmetic instructions prior 
to the branch to set up the branch conditions. 

• Coprocessor instructions perform operations in the 
coprocessors. Coprocessor Loads and Stores are l-type. 

• Coprocessor 0 instructions perform operations on the 
System Control Coprocessor (CPO) registers to manipu- 
late the memory management and exception handling 
facilities of the processor. 

• Special instructions perform a variety of tasks, including 
movement of data between special and general registers, 
system calls, and breakpoint. They are always R-type. 

Table 1 lists the instruction set of the IDT79R3500A 
processor. 

IDT79R3500A System Control Coprocessor (CPO) 

The IDT79R3500A can operate with up to four tightly- 
coupled coprocessors (designated CPO through CP3). The 
System Control Coprocessor (or CPO), is incorporated on the 
IDT79R3500A chip and supports the virtual memory system 
and exception handling functions of the IDT79R3500A. The 
virtual memory system is implemented using a Translation 
Lookaside Buffer and a group of programmable registers as 
shown in Figure 5. 

System Control Coprocessor (CPO) Registers 

The CPO registers shown in Figure 5 are used to control 
the memory management and exception handling capabilities 
of the IDT79R3500A. Table 2 provides a brief description of 
each register. 


Register 

Description 

EntryHi 

High half of a TLB entry 

Entry Lo 

Low half of a TLB entry 

Index 

Programmable pointer into TLB array 

Random 

Pseudo-random pointer into TLB array 

Status 

Mode, interrupt enables, and diagnostic status info 

Cause 

Indicates nature of last exception 

EPC 

Exception Program Counter 

Context 

Pointer into kernel’s virtual Page Table Entry array 

BadVA 

Most recent bad virtual address 

PRId 

Processor revision identification (Read only) 
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Table 2. System Control Coprocessor (CPO) Registers 


SYSTEM COPROCESSOR 



| | Used with Virtual Memory System 

[~1 Used with Exception Processing 
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Figure 5. The System Coprocessor Registers 
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Memory Management System 

The IDT79R3500A has an addressing range of 4 Gbytes. 
However, since most 1DT79R3500A systems implement a 
physical memory smaller than 4Gbytes, the IDT79R3500A 
provides for the logical expansion of memory space by trans- 
lating addresses composed in a large virtual address space 
into available physical memory address. Two TLB modes are 
supported. When the TLB is used, the 4 GByte address space 
is divided into 2 GBytes which can be accessed by both the 
users and the kernel, and 2 GBytes for the kernel only. Virtual 
addresses within the kernel/user segment are translated to 
physical addresses on a 4kB page basis. This mode is typical 
of UNIX and other sophisticated operating systems. When the 
TLB is disabled, mapping is locked as 2 GBytes as kernel/ 
user, and 1.5 GBytes as kernel only. This mode requires no 
TLB manipulation, provides large linear address space, and is 
typical for embedded applications. 

TLB (Translation Lookaside Buffer) 

Virtual memory mapping is assisted by the Translation 
Lookaside Buffer (TLB). The on-chip TLB provides very fast 
virtual memory access and is well-matched to the require- 
ments of multi-tasking operating systems. The fully-associa- 
tive TLB contains 64 entries, each of which maps a 4-Kbyte 
page, with controls for read/write access, cacheability, and 
process identification. The TLB allows each userto access up 
to 2 Gbytes of virtual address space. 

Figure 6 illustrates the format of each TLB entry. The 


Translation operation involves matching the current Process 
I D (PID) and upper 20 bits of the address against PID and VPN 
(Virtual Page Number) fields in the TLB. When both match (or 
the TLB entry is Global), the VPN is replaced with the PFN 
(Physical Frame Number) to form the physical address. 

TLB misses are handled in software, with the entry to be re- 
placed determined by as imple RANDOM function. The rou- 
tine to process a TLB miss in the UNIX environment requires 
only 10-1 2 cycles, which compares favorably with many CPUs 
which perform the operation in hardware. 

TLB Disabled Operation 

Many embedded systems do not like the complexity or 
uncertainty associated with the on-chip TLB. However, many 
systems still desire the ability to implement a kernel/user 
mode. Therefore, to implement a hierachical task model, the 
TLB must be used. The IDTR3500A gives the system de- 
signer one more option, allowing the TLB to be disabled and 
performing a fixed mapping of virtual to physical addresses, 
while maintaining separation of kernel and user resources. 

The user may elect to disable the TLB through the reset 
sectors. In this case, the mapping shown in Figure 8. is used, 
and device power consumption is reduced. Note that 
“cached" segments means that there is no mechanism to 
exclude addresses in these regions from the cache. 

This mapping means that applications designed to run in 
ksegO and ksegl (to avoid the TLB) can use the IDT3500A, 
disable the TLB to reduce power, and not have to change 
software to take advantage of this new feature. 


TLB ENTRY FORMAT 


63 

44 

43 38 

37 32 

31 12 

11 

10 

9 

8 

7 


0 

VPN 

TLBPID 

O 

PFN 

N 

D 

V 

G 

O 

v 


Y 

A 











J 


ENTRYHI ENTRYLO 


VPN - Virtual Page Number 
TLBPID - Process ID 
PFN - Physical Frame Number 
N - Non-cacheable flag 
D - Dirty flag (Write protect) 

V- Valid entry flag 
G - Global flag (ignore PID) 

O - Reserved 


Figure 6. TLB Entry Format 
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MMU ADDRESS TRANSLATION 
VIRTUAL PHYSICAL 


OxFFFFFFFF 

KERNEL 

MAPPED 

ANY > 

0x00000000 

CACHEABLE 

(kseg2) 

— V 

OxAOOOOOOO 

KERNEL 

UNMAPPED 

UNCACHED 

(ksegl) 


0x80000000 

KERNEL 

UNMAPPED 

CACHED 

(ksegO) 


0x7FFFFFFF 

KERNEL/USER 

MAPPED 

ANY > 

0 

CACHEABLE 

(kuseg) 

— V 


OxFFFFFFFF 

*\ 


PHYSICAL 

MEMORY 


>- 3584 MB 

0x20000000 



0x1 FFFFFFF 


MEMORY 


>- 512 MB 

0x00000000 
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Figure 7. IDT79R3500A Virtual Address Mapping 
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2048 MB 
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Figure 8. TBL Disabled Mapping 
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NOTE: This model is consistent with the mapping available in the IDT79R3051 family. The identical mapping provides software compatibility to the 
lower cost CPUs. 


5.4 


7 








IDT79R3500A RISCore™ RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


Operating Modes 

The IDT79R3500A has two operating modes: User mode 
and Kernel mode. The 1DT79R3500A normally operates in the 
User mode until an exception is detected forcing it into the 
Kernel mode. It remains in the Kernel mode until a Restore 
From Exception (RFE) instruction is executed. The manner in 
which memory addresses are translated or mapped depends 
on the operating mode of the IDT79R3500A. Figure 7 shows 
the MMU translation performed for each of the operating 
modes. 

User Mode — in this mode, a single, uniform virtual address 
space (kuseg) of 2 Gbyte is available. When the TLB is used, 
each virtual address is extended with a 6-bit process identifier 
field to form unique virtual addresses. All references to this 
segment are mapped through the TLB. Use of the cache for 
up to 64 processes is determined by bit settings for each page 
within the TLB entries. If the TLB is not used, these addresses 
are translated to begin at 1Gbyte of the physical address 
space. 

Kernel Mode — four separate segments are defined in this 
mode: 

• kuseg — when in the kernel mode, references to this seg- 
ment are treated just like user mode references, thus 
streamlining kernel access to user data. 

• ksegO— references to this 512 Mbyte segment use cache 
memory but are not mapped through the TLB. Instead, they 
always map to the first 0.5 GBytes of physical address 
space. 

• ksegl — references to this 512 Mbyte segment are not 
mapped through the TLB and do not use the cache. 
Instead, they are hard-mapped into the same 0.5 GByte 
segment of physical address space as ksegO. 

• kseg2—' when the TLB is not used, references to this 
1Gbyte segment directly addresses the upper 1Gbyte of 
physical address space. These addresses are defined to 
be kernel mode which are cacheable. When the TLB is 
used, references to this 1 Gbyte segment are always mapped 
through the TLB and use of the cache is determined by bit 
settings within the TLB entry. 

FPA COPROCESSOR OPERATION (CPI) 

The FPA continually monitors the processor instruction 
stream. If an instruction does not apply to the coprocessor, it 
is ignored; if an instruction does apply to the coprocessor, the 
FPA executes that instruction and transfers necessary result 
and exception data synchronously to the main processor. 
The FPA performs three types of operations: 

• Loads and Stores; 

• Moves; 

• Two- and three-register floating-point operations. 


Load, Store, and Move Operation 

Load, Store, and Move operations data between memory 
or the integer registers and the FPA registers. These opera- 
tions perform no format conversions and cause no floating- 
point exceptions. Load, Store, and Move operations reference 
a single 32-bit word of either the Floating-Point General 
Registers (FGR) orthe Floating-Point Control Registers (FCR). 


Floating-Point Operations 

The FPA supports the following single- and double-preci- 
sion format floating-point operations: 

• Add 


• Subtract 

• Multiply 

• Divide 

• Absolute Value 

• Move 

• Negate 

• Compare 

In addition, the FPA supports conversions between single- 
and double-precision floating-point formats and fixed-point 
formats. 

The FPA incorporates separate Add/Subtract, Multiply, 
and Divide units, each capable of independent and concurrent 
operation. Thus, to achieve very high performance, floating 
point divides can be overlapped with floating point multiplies 
and floating point additions. These floating point operations 
occur independently of the actions of the CPU, allowing 
further overlap of integer and floating point operations. Figure 
9 illustrates an example of the types of overlap permissible. 



Exceptions 

The FPA supports all five IEEE standard exceptions: 

• Invalid Operation 

• Inexact Operation 

• Division by Zero 

• Overflow 

• Underflow 

The FPA also suppoerts the optional, Unimplemented 
Operation exception that allows unimplemented instructions 
to trap to software emulation routines. 

The FPA provides precise exception capability to the CPU; 
that is, the execution of a floating point operation which 
generates an exception causes that exception to occur at the 
CPU instruction which caused the operation. This precise 
exception capability is a requirement in applications and 
languages which provide a mechanism for local software 
exception handlers within software modules. 
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DIV.S 


MUL.S 


STORE (SWC1) 


NON FPU 


Only Load, Store, and Move operations 
are permitted in FPA during these cycles. 

Other FPA instructions can proceed during 
these cycles. However, two multiply or two 
divide operation cannot be overlapped. 

These cycles are free for integer operations 
in the CPU. 



ADD.S 

STORE (SWC1) 

LOAD (LWC1) 
STORE (SWC1) 


NON FPU 


Figure 9. Examples of Overlapping Floating Point Operation 


IF 

RD 

ALU 

MEM 

WB 

FWB 

1-Cache 

RF 

OP 

D-Cache 

Register file 
write back or 
FP exceptions 

— 

*FpWB 

1 1 





One Cycle 

Figure 10. Instruction Execution 


* FP ops only 

2871 drw 10 



RD 

ALU 

MEM 

WB 

*FWB 






IF 

RD 

ALU 

MEM 

WB 

*FWB 






IF 

RD 

ALU 

MEM 

WB 

*FWB 





IF ' 

RD 

ALU 

MEM 

WB 

*FWB 


Instruction 

Flow 



IF 

RD 

ALU 

MEM 

WB 









RD 

ALU 

MEM 

WB 

*FWB 


Current 

CPU 

Cycle 


2871 drw 11 


Figure 11. IDT79R3500A Execution Sequence 
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IDT79R3500 PIPELINE ARCHITECTURE 

The execution of a single IDT79R3500A integer instruction 
consists of five pipe stages while floating point instruction 
takes six pipe stages. They are: 

1) IF — Instruction fetch. The processorcalculates the instruc- 
tion address required to read from the I cache. 

2) RD— The instruction is present on the data bus during 
phase one of this pipe stage. Instruction decode occurs 
during phase two. Operands are read from the registers if 
required. 

3 ALU — Perform the required operation on instruction oper- 
ands. If this is a FPA instruction, instruction execution 
commences. 

4) MEM — Access memory. If the instruction is a load or store, 
the data is presented or captured during phase 2 of this 
pipe stage. 

5) WB — Write integer results back into register file. In FPA 
cycles this pipe stage is used for exceptions. 

6) FWB— The FPA uses this stage to write back ALU results 
to its register file. 

Each of these steps requires approximately one FPA cycle 
as shown in Figure 1 0. (parts of some operations spill over into 
another cycle while other operations require only 1/2 cycle.) 

The CPU uses a five stage pipeline while while the FPA 
uses a 6 stage to achieve an instruction execution rate 
approaching one instruction per cycle. Thus, execution of six 
instructions at a time are overlapped as shown in Figure 1 1 . 

This pipeline operates efficiently because different CPU 
resources (address and data bus accesses, ALU operations, 
register accesses, and so on) are utilized on a non-interfering 
basis. 

MEMORY SYSTEM HIERARCHY 

The high performance capabilities of the IDT79R3500A 
processor demand system configurations incorporating tech- 
niques frequently employed in large, mainframe computers 
but seldom encountered in systems based on more traditional 
microprocessors. 

A primary goal of systems employing RISC techniques is to 
minimize the average number of cycles each instruction 
requires for execution. Techniques to reduce cycles-per- 
instruction include a compact and uniform instruction set, a 
deep instruction pipeline (as described above), and utilization 
of optimizing compilers. Many of the advantages obtained 
from these techniques can, however, be negated by an 
inefficient memory system. 

Figure 12 illustrates memory in a simple microprocessor 
system. In this system, the CPU outputs addresses to memory 
and reads instructions and data from memory or writes data to 
memory. The address space is completely undifferentiated: 
instructions, data, and I/O devices are all treated the same. In 
such a system, a primary limiting performance factor is 
memory bandwidth. 
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Figure 12. A Simple Microprocessor Memory System 


Figure 13 illustrates a memory system that supports the 
significantly greater memory bandwidth required to take full 
advantage of the IDT79R3500A’s performance capabilities. 
The key features of this system are: 
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Figure 13. An IDT79R3500A System with a 
High-Performance Memory System 
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• External Cache Memory— Local, high-speed memory 
(called cache memory) is used to hold instructions and data 
that is repetitively accessed by the CPU (for example, 
within a program loop) and thus reduces the number of 
references that must be made to the slower-speed main 
memory. Some microprocessors provide a limited amount 
of cache memory on the CPU chip itself. The external 
caches supported by the IDT79R3500A can be much 
larger; while a small cache can improve performance of 
some programs, significant improvements for a wide range 
of programs require large caches. 

• Separate Caches for Data and Instructions— Even with 
high-speed caches, memory speed can still be a limiting 
factor because of the fast cycle time of a high-performance 
microprocessor. The IDT79R3500A supports separate 
caches for instructions and data and alternates accesses 
of the two caches during each CPU cycle. Thus, the 
processor can obtain data and instructions at the cycle rate 
of the CPU using caches constructed with commercially 
available IDT static RAM devices. 

Inorderto maximize bandwidth in the cache while minimiz- 
ing the requirement for SRAM access speed, the R3500A 
divides a single-processor clock cycle into two phases. 
During one phase, the address for the data cache access 
is presented while data previously addressed in the in- 
struction cache is read; during the next phase, the data 
operation is completed while the instruction cache is being 
addressed. Thus, both caches are read in a single proces- 
sor cycle using only one set of address and data pins. 

• Write Buffer— inorderto ensure data consistency, all data 
that is written to the data cache must also be written out to 
main memory. The cache write model used by the 
IDT79R3500A is that of a write-through cache; that is, all 
data written by the CPU is immediately written into the main 
memory. To relieve the CPU of this responsibility (and the 
inherent performance burden) the IDT79R3500A supports 
an interface to a write buffer. The I DT79R3020 Write Buffer 
captures data (and associated addresses) output by the 
CPU and ensures that the data is passed on to main 
memory. 

IDT79R3500A Processor Subsystem Interfaces 

Figure 14 illustrates the three subsystem interfaces pro- 
vided by the IDT79R3500A processor; 

• Cache control interface (on-chip) for separate data and 
instruction caches permits implementation of off-chip caches 
using standard IDT SRAM devices. The 79R3500A directly 
controls the cache memory with a minimum of external 
components. Both the instruction and data cache can vary 
from 0 to 256K Bytes (64K entries). The 79R3500A also 
includes the TAG control logic which determines whether 
or not the entry read from the cache is the desired data. The 
79R3500A cache controller implements a direct mapped 
cache for high net performance (bandwidth). It has the 


ability to refill multiple words when a cache miss occurs, 
thus reducing the effective miss rate to less than 2% for 
large caches. When a cache miss occurs, the 79R3500A 
can support refilling the cache in 1, 4, 8, 16, or 32 word 
blocks to minimize the effective penalty of having to access 
main memory. The 79R3500A also incorporates the ability 
to perform instruction streaming; while the cache is refill- 
ing, the processor can resume execution once the missed 
word is obtained from main memory. In this way, the 
processor can continue to execute concurrently with the 
cache block refill. 

• Memory controller interface for system (main) memory. 
This interface also includes the logic and signals to allow 
operation with a write buffer to further improve memory 
bandwidth. In addition to the standard full word access, the 
memory controller supports the ability to write bytes and 
half-words by using partial word operations. The memory 
controller also supports the ability to retry memory ac- 
cesses if, for example, the data returned from memory is 
invalid and a bus error needs to be signalled. 

• Coprocessor Interface — The IDT79R3500 features a set 
of on board tightly coupled coprocessors. Coprocessor 0 is 
defined to be the system control coprocessor and 
Coprocessor 1 is the Floating Point Accelerator. They have 
direct access to the internal data bus which allows them 
direct load and store of data in the same fashion as 
accessing the CPU registers. This relieves the typical 
bottleneck of having to load data into the CPU register set 
and then passing that data off to the co-processors. 

In applications where the FPA was off chip, as in using the 
IDT79R3010A several control pins were used for commu- 
nications with the CPU and a Phase Lock Loopwas located 
on the IDT79R3010A to synchronize the two together. As 
they are now integrated into a single chip, these are no 
longer needed. The FpCond output, which is used in 
coprocessor branch instructions, is now internally tied to 
the CpCond(l) input of the CPU leaving the external 
CpCond(l) pin available for another function. This signal is 
selectable to either output the FpBusy or the FPInt. For 
applications where FPInt was connected to any one of the 
six CPU HW interrupt inputs, that can also be internally 
routed-the default being lnt(3), as recommended by the 
MIPS architecture. If FPInt is internally routed, the external 
interrupt input corresponding to the FP interrupt is ignored. 
Internal routing of these selections are made via the reset 
vector. 

The internal CPBusy input, which is used to stall the CPU 
if the coprocessor needs to hold off subsequent opera- 
tions, has two sources-FPBusy and the external CpBusy 
pin which are logically ORed together. Further, Run and 
Exception of both the FPA and CPU are internally tied and 
brought out with the external CPBusy input to accommo- 
date off chip coprocessor 2 and 3. This external interface 
is available to support application specific functions. 
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MULTIPROCESSING SUPPORT 

The IDT79R3500A supports multiprocessing applications 
in a simple but effective way. Multiprocessing applications 
require cache coherency across the multiple processors. The 
IDT79R3500A offers two signals to support cache coherency: 
the first, MPStall, stalls the processor within two cycles of 
being received and keeps it from accessing the cache. This 
allows an external agent to snoop into the processor data 
cache. The second signal, MPInvalidate, causes the proces- 
sor to write data on the data cache bus which indicates the 
externally addressed cache entry is invalid. Thus, a subse- 
quent access to that location would result in a cache miss, and 
the data would be obtained from main memory. 

The two MP signals would be generated by a external logic 
which utilizes a secondary cache to perform bus snooping 
functions. The 79R3500A does not impose an architecture for 
this secondary cache, but rather is flexible enough to support 
a variety of application specific architectures and still maintain 
cache coherency. Further, there is no impact on designs 
which do not require this feature. The 79R3500A further 
allows the use of cache RAMs with internal address latches 
in multiprocessor systems. 

ADVANCED FEATURES 

The IDT79R3500A offers a number of additional features 
such as the ability to swap the instruction and data caches, 
facilitating diagnostics and cache flushing. Another feature 
isolates the, caches, which forces cache hits to occur regard- 
less of the contents of the tag fields. The I DT79R3500 A allows 
the processor to execute user tasks of the opposite byte 
ordering (endianness) of the operating system, has double the 
integer multiply/divide performance of R3000 and R2000, has 
a programmable Tag width bus, and further allows parity 
checking to be disabled. More details on these features can be 
found in the IDT79R3500 Family Hardware User’s Manual. 

Further features of the IDT79R3500A are configured dur- 
ing the last four cycles prior to the negation of the RESET 
input. These functions include the ability to select cache sizes 
and cache refill block sizes; the ability to utilize the multipro- 
cessor interface; whether or not instruction streaming is 
enabled; whether byte ordering follows “Big-Endian" or “Little- 
Endian" protocols, etc. T able 3 shows the configuration options 
selected at Reset. These are further discussed in the “Hard- 
ware User’s Manual”. 

BACKWARD COMPATIBILITY 

The primary goal of the 79R3500A is the ability to replace 
the R3000 and R301 0 with a single chip solution. This can be 
doneeitherthe R3000/R3010orthe R3000A/R3010Aaswell. 
The pinout of the IDT79R3500A has been selected to ensure 
this compatibility, with new functions mapped onto previously 
used pins. The instruction set is compatible with that of the 
R2000 at the binary level. As a result, code written forthe older 
processor can be executed. 


In most R3000A applications, the IDT79R3500 can be 
placed in the socket with no modification to initialization 
settings. Additionally, the IDT79R3500 can be used in sys- 
tems that did not include the R3010 in the original design. 
Further application assistance on these topics are available 
from IDT. 

PACKAGE THERMAL SPECIFICATIONS 

The IDT79R350QA utilizes special packaging techniques 
to improve both the thermal and electrical characteristics of 
the microprocessor. 

In order to improve the electrical characteristics of the 
device, the package is constructed using multiple signal 
planes, including individual power planes and ground planes 
to reduce noise associated with high-frequency TTL parts. In 
addition, the 175-pin PGA package utilizes extra power and 
ground pins to reduce the inductance from the internal power 
planes to the power planes of the PC Board. 

In order to improve the electrical characteristics of the 
microprocessor, the device is housed using cavity down 
packaging. In addition, these packages incorporate a copper- 
tungsten thermal slug designed to efficiently transfer heat 
from the die to the case of the package, and thus effectively 
lower the thermal resistance of the package. The use of an 
additional external heat sink affixed to the package thermal 
slug further decreases the effective thermal resistance of the 
package. 

The case temperature may be measured in any environ- 
ment to determine whether the device is within the specified 
operating range. The case temperature should be measured 
at the center of the top surface opposite the package cavity 
(the package cavity is the side where the package lid is 
mounted). 

The equivalent allowable ambient temperature, Ta, can be 
calculated using the thermal resistance from case to ambient 
(0ca) for the given package. The following equation relates 
ambient and case temperature: 

TA = Tc - P*0ca 

where P is the maximum powerconsumption, calculated by 
using the maximum Icc from the DC Electrical Characteristics 
section. 

Typical values for 0ca at various airflows are shown in 
table 2 for the various CPU packages. 



Airflow - (ft/min) 

0 

200 

400 

600 

800 

1000 

0ca (175-PGA, 
144-PGA) 

21 

7 

3 

2 

1 

0.5 

0ca (172 Quad 
Flatpack) 

23 

9 

4 

3 

2.5 

1.5 


2871 tbl 03 


Table 2. R3500A Package Characteristics 
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Input 

W Cycle 

X Cycle 

Y Cycle 

Z Cycle 

Into 

DBIkSizeO 

DBIkSizel 

Extend Cache 

Big Endian 

Inti 

IBIkSizeO 

IBIkSizel 

MPAdrDisable 

TriState 

Int2 

DispPar/RevEnd 

IStream 

IgnoreParity 

NoCache 

Int3 

Reserved^ 1 ) 

StorePartial 

Multiprocessor 

BusDriveOn 

Int4 

FPINT decode 

FPINT decode 

FPINT decode 

FPINT onto CpCond 

IntS 

7RR3500 mode 

TLB disable 

Tag Mode 1 

Tag Mode 0 


NOTES: 

1 . Reserved entries must be driven high. 

2. These values must be driven stable throughout the entire RESET period. 


Table 3. R3500A Mode Selectable Features 
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Figure 14. IDT79R3500A Subsystem Interfaces Example; 64 KB Caches 
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PIN CONFIGURATION 
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T 72*Pin Flatpack(Top View) 


NOTES: 

1 . Reserved pins must be connected. 

2. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time 
AdrLo 16: MP Invalidate, CpCond (2). 

AdrLo 17: MP Stall, CpCond (3). 
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PIN CONFIGURATION 
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NOTE: 

1. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time 
AdrLo 16: MP Invalidate, CpCond (2). 

AdrLo 17: MP Stall, CpCond (3). 
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PIN CONFIGURATION 
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144-Pin PGA (Top View) 

NOTE: 

1. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time 
AdrLo 16: MP Invalidate, CpCond (2). 

AdrLo 17: MP Stall, CpCond (3). 
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PIN DESCRIPTIONS 


Pin Name 


Description 

Data (0-31) 

n 

A 32-bit bus used for all instruction and data transmissbn among the processor, caches, memory interface, and 
coprocessors. 

DataP (0-3) 


A 4-bit bus containing even parity over the data bus. 

Tag (12-31) 


A 20-bit bus used fortransferring cache tags and high addresses between the processor, caches, and memory interface. 

TagV 


The tag validity indicator. 

Tag P (0-2) 

ca 

A 3-bit bus containing even parity over the concatenation of TagV and Tag. 

AdrLo (0-17) 

o 

An 1 8-bit bus containing byte addresses used for transferring bw addresses from the processorto the caches and memory 
interface. (AdrLo 16: CpCond (2), AdrLo 17: CpCond (3) set by reset initializatbn). 

TRdT 

El 

Read enable for the instrudbn cache. 

IWrT 

o 

Write enable for the instrudon cache. 

IRd2 

o 

An identical copy of IRdl used to split the bad. 

IWr2 

o 

An identical copy of IWrl used to split the bad. 

ICIk 

o 

The instrudion cache address latch dock. This clock runs continuously. 

DRdT 

o 

The read enable for the data cache. 

DWrT 

o 

The write enable for the data cache. 

DRd2 

o 

An identical copy of DRdl used to split the load. 

DWr2 

El 

An identical copy of DWrl used to split the load. 

DCIk 

o 

The data cache address latch clock. This clock runs continuously. 

XEn 

o 

The read enable for the Read Buffer. 

AccTyp(0-2) 

o 

A 3-bit bus used to indbate the size of data being transferred on the data bus, whether or not a data transfer is occurring, 
and the purpose of the transfer. 

MemWr 

o 

Signals the occurrence of a main memory write. 

MemRd 

o 

Signals the occurrence of a main memory read. 

BusError 

1 

Signals the occurrence of a bus error during a main memory read or write. 

Run 

o 

Indbates whether the processor is in the RUN or STALL state. In the discrete design, the R3000 Run output is tied directly 
to the R301 0 Run input. In the 79R3500, this is done internally, but the Run signal is also brought out for applbation specific 
coprocessors. 

Exception 

o 

Indbates that the instructbn that is about to commit to a state change should be aborted; also indicates other exception 
related information. In the discrete design, the R3000 Exception output is tied to the R301 0 Exception input. In the 79R3500 
this is done internally, but the Exception signal is also brought out for application specific coprocessors. 

CpSync 

o 

A clock whbh is identical to SysOut and used by external coprocessors for timing synchronizatbn with the 79R3500. In 
the discrete design, CpSync output from the R3000 is tied to the R3010 FPSync input. In the 79R3500, this is done 
internally, but the CpSync signal is also brought out for applbation specific coprocessors. 

RdBusy 

1 

The main memory read stall terminatbn signal. In most system designs RdBusy is normally asserted and is deasserted 
only to indbate the successful completbn of a memory read. RdBusy is sampled by the processor only during memory 
read stalls. 

WrBusy 

' 

The main memory write stall initiationAermination signal. 

CpBusy 

1 

Input used to indicate that the requested coprocessor resource is unavailable, or used to preserve the precise exception 
model. In the descrete design, CpBusy is driven directly by the R301 0 FpBusy output. In the 79R3500 the CpBusy input 
of the CPU is the bgbal OR of both the internal FPA FpBusy and the external CpBusy pin. This input is provided for 
external applbation specific coprocessors. An internal pull down resistor is provided if this input is left open. 

CpCond(1 ) 

1 

Signal used by the branch on Coprocessor 1 trueAalse instructbn. Indiscrete systems using a R301 0 FPA, this is normally 
tied to the FpCond output. In the 79R3500, the internal FpCond is directly tied to the internal CpCond(1 ) input leaving this 
pin available for other functbns.This pin defaults to output the FpBusy internal signal or, (via the Reset vectors), output the 
FPInt — in the latter case, external hardware must route this signal to the appropriate Int pin. 

CpCond (0,2-3) 

1 

Conditbnal branch status from coprocessors to the processor. Functbn is provided on AdrLo 1 6/1 7 pins and is selected 
at reset time. 

MPStall 

1 

Multiprocessing Stall. Signals to the processor that it should stall accessestothecaches in a multiprocessing environment. 
This is physically the same pin as CpCond3; its use is determined at RESET initialization. 

MPInvalidate 

1 

Multiprocessing Invalidate. Signals to the processor that it should issue invalidate data on the cache data bus. The address 
to be invalidated is externally provided. This is the same pin as CpCond2; its use is determined at RESET initializatbn. 


Int (0-5) 


A 6-bit bus used by the memory interface and coprocessors to signal maskable interrupts to the 79 R3500. This bus is also 
used at reset time to select among the mode-selectable features of the 79R3500. The FPA FPInt output signal is typically 
connected tooneof these interrupt lines;the choice is programmablethrough the reset vectors with thedefault being lnt(3). 






































































IDT79R3500A RISCore™ RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


PIN DESCRIPTIONS (Continued) 


Pin Name 


Description 

Clk2xSys 

1 

The master double frequency input clock used for generating SysOut. 

Clk2xSmp 

1 

A double frequency clock input used to determine the sample point for data coming into the processor and 
coprocessors. 

Clk2xRd 

1 

A double frequency clock input used to determine the enable time of the cache RAMs. 

Clk2xPhi 

1 

A double frequency clock input used to determine the position of the internal phases, phasel and phase2. 

Reset 

■ 

Synchronous initialization input used to force execution starting from the reset memory address. Reset must be 
deasseted synchronously but asserted asynchronously. The deassertion of Reset must be synchronized by the 
leading edge of SysOut. 
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ABSOLUTE MAXIMUM RATINGS 0 ’ 3) 


EHS50 

Rating 

Commercial 

Military 

Unit 

Vterm 

Terminal Voltage 
with Respect 
toGND 

-0.5 to +7.0 

-0.5 to +7.0 

V 

Ta.Tc 

Operating 

Temperature 

0 to +70< 4 > 
(Ambient) 

0 to +90 (5 > 
(Case) 

-55 to +125 
(Case) 

1 

Tbias 

Case Temperature 
Under Bias 

-55 to +125 (4) 
0 to +90< 5 > 

—65 to +135 

■ 

Tstg 

Storage 

Temperature 

-55 to +125 

-65 to +155 

°c 

IlN 

Input Voltage 

-0.5 to +7.0 

-0.5 to +7.0 

V 
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RECOMMENDED OPERATING 
TEMPERATURE AND SUPPLY VOLTAGE 


Grade 

Temperature 

GND 

Vcc 

Military 

16-33 MHZ 

-55°C to +125°C 
(Case) 

OV 

5.0 ±10% 

Commercial 

1 6-33 MHz 

0°C to +70°C 
(Ambient) 

ov 

5.0 ±5% 

Commercial 
37-40 MHz 

0°C to +90°C 
(Case) 

OV 

5.0 ±5% 


OUTPUT LOADING FOR AC TESTING 



NOTE: 

1 . Stresses greater than those listed u nder ABSOLUTE MAXI MU M RATI NGS 
may cause permanent damage to the device. This is a stress rating only 
and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended 
periods may affect reliability. 

2. Vin minimum = -3.0V for pulse width less than 15ns. 

Vin should not exceed Vcc +0.5 Volts. 

3. Notmorethanoneoutputshouldbeshortedatatime. Duration of the short 
should not exceed 30 seconds. 

4. 16-33 MHz only. 

5. 37-40 MHz only. 


AC TEST CONDITIONS 



Symbol 

Parameter 

Min. 

Max. 

Unit 

VlH 

input HIGH Voltage 

3.0 

— 

V 

VlL 

Input LOW Voltage 

— 

0.4 

V 

VlHS 

Input HIGH Voltage 

3.5 

— 

V 

VlLS 

Input LOW Voltage 

, — 

0.4 

V 
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IDT79R3500A RISCore™ RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS— 

COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V ±5%) 


Symbol 

Parameter 

Test Conditions 

79R3500A 

79R3500AE 

Unit 

16.67MHz 

20.0MHz 

25.0MHz 

33.33MHz 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

VOH 

Output HIGH Voltage 

Vcc = Min., Ioh = -4mA 

3.5 

— 

3.5 

— 

3.5 

— 

3.5 

— 

D 

VOL 

Output LOW Voltage 

Vcc = Min., Iol = 4mA 

— 

0.4 

— 

0.4 

— 

0.4 

— it 

|0.4 

D 

Vohc 

Output HIGH Voltage* 7 ) 

Vcc = Min., Ioh = -4mA 

4.0 

— 

4.0 

— 

4.0 

— . 

4.0 

. — 

D 

Voht 

Output HIGH Voltage* 4 - 6 ) 

Vcc = Min., Ioh = -8mA 

m 

— 

El 

— 

El 

SSI* 

m 

— 

V 

Volt 

Output LOW Voltage* 4 - 6 ) 

Vcc = Min., Iol = 8mA 

— 

0.8 

— 

0.8 

— 

0.8 

w — 

0.8 

V 

Vih 

Input HIGH Voltage* 5 ) 


2.0 

— 

2.0 

— 

2.0 


2.0 

— 

V 

Vil 

Input LOW Voltage* 1 ) 


— 

0.8 

— 

0.8 

:: — ;• 

0.8 

— 

0.8 

D 

Vihs 

Input HIGH Voltage* 2 ’ 5 ) 


3.0 

— 

3.0 

— 

3.0 

— 

ill 

— 

D 

Vils 

Input LOW Voltage* 1 ' 2 ) 


— 

0.4 

— 

0.4 

— 

0.4 

— 

0.4 

V 

ClN 

Input Capacitance* 6 ) 


— 

10 


TO 

— 

10 

— 

10 

PF 

Cout 

Output Capacitance* 6 ) 


— 

10 

— 

10 

— 

10 

— 

10 

PF 

Icc 

Operating Current 

Vcc = 5V, Ta = 70°C 

— 

450 


550 

— 

650 

— 

750 


IlH 

Input HIGH Leakage* 3 ) 

Vih = VCC 


: 100; 

— 

100 

— 

100 

— 

100 

pA 

IlL 

Input LOW Leakage* 3 ) 

Vil = GND 



-100 

— 

-100 

— 

-100 

— 

pA 

loz 

Output Tri-state Leakage 

Voh = VCC, VOL = GND 

-100 

100 

-100 

100 

-100 

100 

-100 

100 

PA 


2871 tbl 09 


NOTES: 

1. Vn Min. = -3.0V for pulse width less than 15ns. Vil should not fall below -0.5 V olts for larger periods. 

2. Vihs and Vus apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. 

4. Voht and Volt apply to the bidirectional data and tag busses only. Note that Vih and Vil also apply to these signals. Voht and Volt are provided 
to give the designer further information about these specific signals. 

5. Vih should not be held above Vcc + 0.5 volts. 

6. Guaranteed by design . 

7. Vohc applies to RUN and Exception. 
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IDT79R3500A RISCore™ RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS— 


MILITARY TEMPERATURE RANGE (Tc = -55°C to +125°C, Vcc = +5.0V±10%) 


Symbol 

Parameter 

Test Conditions 

79R3500A 

79R3500AE 

Unit 

16.67MHz 

20.0MHz 

25.0MHz 

33.33MHz 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

VOH 

Output HIGH Voltage 

Vcc = Min., Ioh = -4mA 

3.5 

— 

3.5 

— 

3.5 

— 

3.5 

— 

0! 

VOL 

Output LOW Voltage 

Vcc = Min., Iol = 4mA 

— 

0.4 

— 

0.4 

— 

0.4 

Pit 

|,0.4 

0 

Vohc 

Output HIGH Voltage* 7 ' 

Vcc = Min., Ioh = -4mA 

4.0 

— 

4.0 

— 

4.0 

— 

4.0 

— 

o 

Voht 

Output HIGH Voltage* 4 ' 6 ) 

Vcc = Min., Ioh = -8mA 

ESI 

— 

El 

— 

El 

•!&,' 


— 

V 

Volt 

Output LOW Voltage* 4 ’ 6 ) 

Vcc = Min., Iol = 8mA 

— 

0.8 

— 

0.8 

— •if 

0.8 

— 

0.8 

V 

Vih 

Input HIGH Voltage* 5 ) 


2.0 

— 

2.0 

— 

2.0 

IP- 

2.0 

— 

V 

Vil 

Input LOW Voltage* 1 ) 


— 

0.8 

— 

0.8|! 

— 

0.8 

— 

0.8 

a 

Vihs 

Input HIGH Voltage* 2 ' 5 ) 


3.0 

— 

El 

,ip| 

3.0 

— 

3.0 

— 

d 

Vils 

Input LOW Voltage* 1 ' 2 ) 


— 

ESI 

— 


— 

0.4 

— 

0.4 

0 

Cin 

Input Capacitance* 6 ) 


— 

10 


#10 

— 

10 

— 

10 

0, 

Cout 

Output Capacitance* 6 ) 


— 

ioi 

IIP' 1 

10 

— 

10 

— 

10 

m 

Icc 

Operating Current 

Vcc = 5V, Ta = 70°C 

— 

mm 

> — 

600 

— 

650 

— 

750 


IlH 

Input HIGH Leakage* 3 ) 

Vih = VCC 

iff 

§t oo 

— 

100 

— 

100 

— 

100 

pA 

IlL 

Input LOW Leakage* 3 ) 

Vil = GND 

-1 00? 

1 — 

-100 

— 

-100 

— 

-100 

— 

pA 

loz 

Output Tri-state Leakage 

Voh = VCC, Vol = GND 

-100 

100 

-100 

100 

-100 

100 

-100 

100 

pA 
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NOTES: 

1. Vil Min. = -3.0V for pulse width less than 15ns. Vil should not fall below -0.5 V olts for larger periods. 

2. Vihs and Vils apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. 

4. Voht and Volt apply to the bidirectional data and tag busses only. Note that Vih and Vil also apply to these signals. Voht and Volt are provided 
to give the designer further information about these specific signals. 

5. Vih should not bo held above Vcc + 0.5 volts. 

6. Guaranteed by design . 

7. Vohc applies to RUN and Exception. 
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IDT79R3500A RISCore™ RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS— 

COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V ±5%) 


79R3500AE 


Test Conditions 


Vcc = Min., Ioh = -4mA 


Vcc = Min., Iol = 4mA 


Vcc = Min., Ioh = -4mA 


Vcc ■ Min., Ioh = -8mA 


Vcc = Min., Iol = 8mA 


Symbol 

Parameter 

Voh 

Output HIGH Voltage 

Vol 

Output LOW Voltage 

Vohc 

Output HIGH Voltage* 7 ) 

Voht 

Output HIGH Voltage* 4 ’ 6 ) 

Volt 

Output LOW Voltage* 4 ' 6 ) 

Vih 

Input HIGH Voltage* 5 ) 

Vil 

Input LOW Voltage* 1 ) 

Vihs 

Input HIGH Voltage* 2 ' 5 ) 

Vils 

Input LOW Voltage* 1 ' 2 ) 

Cin 

Input Capacitance* 6 ) 

Cout 

Output Capacitance* 6 ) 

Icc 

Operating Current 

IlH 

Input HIGH Leakage* 3 ) 

IlL 

Input LOW Leakage* 3 ) 

loz 

Output Tri-state Leakage 




Vcc = 5V, Ta = 70°C 


Vih = VCC 


Vil = GND 


Voh = VCC, Vol = GND 



1. Vil Min. = -3.0V for pulse width less than 15ns. Vil should not fall below -0.5 V olts for larger periods. 

2. Vihs and Vils apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. 

4. Voht and Volt apply to the bidirectional data and tag busses only. Note that Vih and Vil also apply to these signals. Voht and Volt are provided 
to give the designer further information about these specific signals. 

5. Vih should not be held above Vcc + 0.5 volts. 

6. Guaranteed by design . 

7. Vohc applies to RUN and Exception. 
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1DT79R3500A RISCore™ RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS* 1 - 2 ’ 3 >— 

COMMERCIAL TEMPERATURE RANGE (Ta = 0°Cto +70°C, Vcc = +5.0V±5%) 


Symbol Parameter 


Clock 


TckHigh Input Clock High* 2 * 


TckLow I Input Clock Low! 2 * 


TckP 


Run Operation 


TDEn Data Enable* 3 ) 


Tddis Data Disable* 3 ) 


TDVal Data Valid 


TwrDiy Write Delay 


Tds Data Set-up 


Tdh Data Hold* 3 ) 


Tens CpBusy Set-up 


Tcbh CpBusy Hold 


TAdy Access Type (1:0) 


TaT 2 Access Type (2) 


TMWr Memory Write 


TExc Exception 


TAval Address Valid 


Tints lnt(n) Set-up 


TintH lnt(n) Hold 


Stall Operation 


Reset Initialization 


79R3500A 


16.67MHz I 20.0MHz 


Test Conditions Min. Max. 


79R3500AE 

25.0 MHz I 33,33MHz 



TSAVal 

Address Valid 

Load= 25pF 

TSAcTy 

Access Type 

Load= 25pF 

TMRdi 

Memory Read Initiate 

Load= 25pF 

TMRdt 

Memory Read Terminate 

Load= 25pF 

Tstl 

Run Terminate 

Load= 25pF 

TRun 

Run Initiate 

Load= 25pF 

TSMWr 

Memory Write 

Load= 25pF 

TSExc 

Exception Valid 

Load= 25pF 


Trst 

Reset Pulse Width 

TrstPLL 

Reset timing. Phase-lock on* 4 ' 5 * 

Trstcp 

Reset timing, Phase-lock off* 4 ' 5 ) 




EBBIBIEiEBH9iSI 





1 

0 

1 


Capacitive Load Deration 


CLD Load Derate* 6 * 


287 ' 

NOTES: 

1 . All timings are referenced to 1 ,5V. 

2. The clock parameters apply to all four 2xC!ocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. These parameters apply when the 79R3010 Floating Point Coprocessor is connected to the CPU. With phase lock on, Reset must be asserted 
for the longer of 3000 clock cycles or 200 microseconds. 

5. Tcyc is one CPU clock cy cle (tw o cycles of a 2x clock). 

6. Wth the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 15%. 

7. Clock transition time < 2.5ns for 33.33MHz; clock transition time < 5ns for other speeds. 
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1DT79R3500A RISCore™ RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS * 1 > 2 > 3 >— 

MILITARY TEMPERATURE RANGE (Tc = -55°C to +125°C, Vcc = +5.0V±10%) 


79R3500A 


79R3500AE 


Symbol Parameter 


Clock 


TckHigh Input Clock High( 2 > 


TckLow | Input Clock Low< 2 ) 


Test Conditions 


16.67MHz 


Mln. Max. 


20.0MHz 




25.0MHz 






Run Operation 


TDEn Data Enabled 


Tddis Data Disable^ 3 ) 


TDVal Data Valid 


TwrOiy | Write Delay 


Data Set-up 


Data Hold( 3) 


CpBusy Set-up 


CpBusy Hold 


Access Type (1:0) 


Access Type (2) 


Memory Write 


Exception 


Address Valid 


lnt(n) Set-up 


lnt(n) Hold 


Stall Operation 



Load= 25pF 


Load= 25pF 


Load= 25pF 


Load= 25pF 


Load= 25pF 


TSAVal 

Address Valid 

Load= 25pF 

- 

30 


23 

rn 

20 

— 

mm 

ns 

TSAcTy 

Access Type 

Load= 25pF 

BOO 

1 

D 

a 

— 


ns 

TMRdi 

Memory Read Initiate 

Load= 25pF 

i 

27 

ma 

S:23 

— 

18 

— 

13.5 

ns 

TMRdt 

Memory Read Terminate 

Load= 25pF 

— 

27 

E! 

23 

— 

18 

— 

10 

ns 

Tsti 

Run Terminate 

Load= 25pF 

3 

17. | 

j§g3§f 

:|s 15 

IDI 

10 

2 

mm 

ns 

TRun 

Run Initiate 

Load= 25pF 

— 7 — 

6 

— 4 

— 

3 

ns 

TSMWr 

Memory Write 

Load= 25pF 

3 

27 

IB 

23 

\mm\ 

18 

2 

9.5 

ns 

TSExc 

Exception Valid 

Load= 25pF 

EBB 

OBBI 

— 

mm 

ns 


Reset Initialization 


Trst 

Reset Pulse Width 

TrstPLL 

Reset timing, Phase-lock on< 4 ' 5 ) 

Trstcp 

Reset timing, Phase-lock off( 4 ' 5) 


I IBlB M IBI BBl iiliEBl 


Capacitive Load Deration 


CLD Load Derate^ 6 ) 



If- 

128 

— 

128 


L 2 J 


| 1 | 




ns/25pF 
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NOTES: 

1. All timings are referenced to 1.5V. 

2. The clock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. These parameters apply when the 79R3010 Floating Point Coprocessor is connected to the CPU. With phase lock on, Reset must be asserted 
for the longer of 3000 clock cycles or 200 microseconds. 

5. Tcyc is one CPU clock cy cle (tw o cycles of a 2x clock). 

6. With the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 1 5%. 

7. Clock transition time < 2.5ns for 33.33MHz; clock transition time < 5ns for other speeds. 


5. 
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IDT79R3500A RISCore™ RISC CPU PROCESSOR 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS^ 1 - 2 - 3 )— 

COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V±5%) 


79R3500AE 


Symbol Paramel 


Clock 


TckHigh Input Clock High* 2 ' 


TckLow | Input Clock Low* 2 ' 


in 

C 

C 

C 


Run Operation 


TDEn Data Enable* 3 ' 


Tddis Data Disable* 3 ' 


TDVal Data Valid 


TwrDiy Write Delay 


Data Set-up 


Data Hold* 3 ' 


CpBusy Set-up 


CpBusy Hold 


Access Type (1:0) 


Access Type (2) 


Memory Write 


Exception 


Address Valid 


lnt(n) Set-up 


lnt(n) Hold 


Stall Operation 


Test Conditions 



Load= 25pF 


Load= 25pF 


Load= 25pF 


Load= 25pF 


Load= 25pF 


TSAVal 

Address Valid 

Load= 25pF 

TSAcTy 

Access Type 

Load= 25pF 

TMRdi 

Memory Read Initiate 

Load= 25pF 

TMRdt 

Memory Read Terminate 

Load= 25pF 

Tsti 

Run Terminate 

Load= 25pF 

TRun 

Run Initiate 

Load= 25pF 

TSMWr 

Memory Write 

Load= 25pF 

TSExc 

Exception Valid 

Load= 25pF 


Reset Initialization 


Trst 

Reset Pulse Width 

TrstPLL 

Reset timing, Phase-lock on* 4 ’ 5 ' 

Trstcp 

Reset timing, Phase-lock off* 4 ’ 5 ' 

Capacitive Load Deration 

CLD 

Load Derate* 6 ' 




12.5 

ns 

9 

ns 

9 

ns 

9 

ns 

6 

ns 

3 

ns 

9 

ns 

6 

ns 



ns/25pF 
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NOTES: 

1. All timings are referenced to 1 ,5V. 

2. The dock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. These parameters apply when the 79R3010 Floating Point Coprocessor is connected to the CPU. With phase lock on, Reset must be asserted 
for the longer of 3000 clock cydes or 200 microseconds. 

5. Tcyc is one CPU clock cy de (tw o cycles of a 2x clock). 

6. With the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 15%. 

7. Clock transition time < 2.5ns. 
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MILITARY AND COMMERCIAL TEMPERATURE RANGES 
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Figure 16. Processor Reference Clock Timing 

* These signals are not actually output from the processor. 

They are drawn to provide a reference for other timing diagrams. 
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Figure 18. Memory Write Timing 
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Figure 19. Memory Read Timing 
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Figure 21. Interrupt Timing 
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Figure 22. Mode Vector Initialization 



NOTES: 

1 . Reset must be neg ated s ynchronously; however, it should be asserted asynchronously. Designs must not rely on the proper functioning of SysOut prior 

to the assertion o f R eset. 

2. If Phase-Lock On or R3000 Mode are asserted as mode select options, they should be asserted throughout the Reset period, to insure that the slowest 
coproc essor in the system has sufficient time to lock to the CPU clocks. 

3. Reset is actually sampled in both Phase 1 and Phase 2. To insure proper initialization, it must be negated relative to the end of Phase 1 . 
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IDT79R3051 FAMILY OF 

INTEGRATED 

RISControllers™ 


ADVANCE INFORMATION 
IDT 79R3051™, 79R3051E 
IDT 79R3052™, 79R3052E 


FEATURES: 

• Instruction set compatible with IDT79R3000A and 
IDT79R3001 MIPS RISC CPUs 

• High level of integration minimizes system cost, power 
consumption 

— 79R3000A/79R3001 Execution Engine 

— R3051 features 4kB of Instruction Cache 

— R3052 features 8kB of Instruction Cache 

— All devices feature 2kB of Data Cache 

— “E” Versions (Extended Architecture) feature full 
function Memory Management Unit, including 64- 
entry Translation Lookaside Buffer (TLB) 

— 4-deep write buffer eliminates memory write stalls 

— 4-deep read buffer supports burst refill 


R3051 FAMILY BLOCK DIAGRAM 


— On-chip DMA arbiter 

— Bus Interface Minimizes Processor Stalls 
Single clock input 

Direct interface to R3720/21/22 RISChipset 

35 MIPS, over 64,000 Dhrystones at 40 MHz 

Low cost 84-pin PLCC packaging 

Flexible bus interface allows simple, low cost designs 

20, 25, 33, and 40 MHz operation 

Complete software support 

— Optimizing compilers 

— Real-time operating systems 

— Monitors/debuggers 

— Floating Point Software 

— Page Description Languages 
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RISControlIer, R305x, R3051, R3052 are trademarks of Integrated Device Technology, Inc. 
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ADVANCE INFORMATION 


INTRODUCTION 

The IDT R3051 Family is a series of high-performance 32- 
bit microprocessors featuring a high level of integration, and 
targeted to high-performance but cost sensitive embedded 
processing applications. The R3051 family is designed to 
bring the high-performance inherent in the MIPS RISC ar- 
chitecture into low-cost, simplified, power sensitive applica- 
tions. 

Functional units were integrated onto the CPU core in order 
to reduce the total system cost, rather than to increase the 
inherent performance of the integer engine. Thus, the R3051 
family is able to offer 35 MIPS of integer performance at 40 
MHz without requiring external SRAM or caches. 

Further, the R3051 family brings dramatic power reduction 
to these embedded applications, allowing the use of low-cost 
packagingfordevicesupto25MHz. TheR3051 family allows 
customer applications to bring maximum performance at 
minimum cost. 

Figure 1 shows a block level representation of the functional 
units within the R3051 family. The R3051 family could be 
viewed as the embodiment of a discrete solution built around 
the I DT 79R3000A or 79R3001 . However, by integrating this 
functionality on a single chip, dramatic cost and power re- 
ductions are achieved. 

Currently, there are four members of the R3051 family 
family. All devices are pin and software compatible: the 
differences lie in the amount of instruction cache, and in the 
memory management capabilities of the processor: 

• The R3052”E" incorporates 8kB of Instruction Cache, and 
features a full function memory management unit (MMU) 
including a 64-entry fully-associative Translation 
Lookaside Buffer (TLB). This is the same memory 
management unit incorporated in the IDT 79R3000A and 
79R3001. 

• The R3052 also incorporates 8kB of Instruction Cache. 
However, the memory management unit is a much 
simpler subset of the capabilities of the enhanced ver- 
sions of the architecture, and in fact does not use a TLB. 

• The R305TE” incorporates 4kB of Instruction Cache. 
Additionally, this device features the same full function 
MMU (including TLB file) as the R3052"E”, and R3000A. 

• The R3051 incorporates 4kB of Instruction Cache, and 
uses the simpler memory management model of the 
R3052. 

An overview of the functional blocks incorporated in these 
devices follows. 

CPU Core 

The CPU core is a full 32-bit RISC integer execution 
engine, capable of sustaining close to single cycle execution 
rate. The CPU core contains a five stage pipeline, and 32 
orthogonal 32-bit registers. The R3051 family implementsthe 
MIPS-IISA. In fact, the execution engine of the R3051 family 
is the same as the execution engine of the R3000A (and 
R3001). Thus, the R3051 family is binary compatible with 
those CPU engines. 


The execution engine of the R3051 family uses a five-stage 
pipeline to achieve close to single cycle execution. A new 
instruction can be started in every clock cycle; the execution 
engine actually processes five instructions concurrently (in 
various pipeline stages). Figure 2 shows the concurrency 
achieved by the R3051 family pipeline. 
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Figure 2. R3051 Family 5-Stage Pipeline 

System Control Co-Processor 

The R3051 family also integrates on-chip the System 
Control Co-processor, CPO. CPO manages both the excep- 
tion handling capability of the R3051 family, as well as the 
virtual to physical mapping of the R3051 family. 

There are two versions of the R3051 family architecture: 
the Extended Architecture Versions (the R3051 Eand R3052E) 
contain a fully associative 64-entry TLB which maps 4kB 
virtual pages into the physical address space. The virtual to 
physical mapping thus includes kernel segments which are 
hard mapped to physical addresses, and kernel and user 
segments which are mapped on a page basis by the TLB into 
anywhere within the 4GB physical address space. In this TLB, 
8 page translationss can be “locked” by the kernel to insure 
deterministic response in real-time applications. These ver- 
sions thus use the same MMU structure as that found in the 
IDT 79R3000A and 79R3001. Figure 3 shows the virtual to 
physical address mapping found in the extended architecture 
versions of the processor family. 

The Extended Architecture devices allow the system de- 
signer to implement kernel software to dynamically manage 
User task utilization of memory resources, and also allow the 
Kernel to effectively “protect” certain resources from user 
tasks. These capabilities are important in a number of 
embedded applications, from process control (where resource 
protection may be extremely important) to X-Window display 
systems (where virtual memory management is extremely 
important). 
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Figure 3. Virtual to Physical Mapping of Extended Architecture Versions 


The base versions of the architecture (the R3051 and 
R3052) removethe TLB and institute afixed address mapping 
for the various segments of the virtual address space. The 
base processors support distinct kernel and user mode op- 
eration without requiring page management software, leading 
to a simpler software model. The memory mapping used by 
these devices is illustrated in figure 4. Note that the reserved 
address spaces shown are for compatibility with future family 
members. 


When using the base versions of the architecture, the 
system designer can implement a distinction between the 
user tasks and the kernel tasks, without having to execute 
page management software. This distinction can take the 
form of physical memory protection, accomplished by address 
decoding, or in other forms. In systems which do not wish to 
implement memory protection, and wish to have the kernel 
and user tasks operate out of a single unified memory space, 
upper address lines can be ignored by the address decoder, 
and thus ail references will be seen in the lowergigabyte of the 
physical address space. 
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Clock Generation Unit 

The R3051 family is driven from a single input clock. On- 
chip, the clock generator unit is responsible for managing the 
interaction of the CPU core, caches, and bus interface. The 
clock generator unit replaces the external delay line required 
in R3000A and R3001 based applications. 

Instruction Cache 

The current family includes two different instruction cache 
sizes: the R3051 family (the R3051 and R3051 E) feature 4kB 
of instruction cache, and the R3052 and R3052E each incor- 
porate 8kB of Instruction Cache. For all four devices, the 
instruction cache is organized as a line size of 16 bytes (four 
words). This relatively large cache achieves a hit rate well in 
excess of 95% in most applications, and substantially con- 
tributes to the performance inherent in the R3051 family. The 
cache is implemented as a direct mapped cache, and is 
capable of caching instructions from anywhere within the 4GB 
physical address space. The cache is implemented using 
physical addresses (rather than virtual addresses), and thus 
does not require flushing on context switch. 

Data Cache 

All four devices incorporate an on-chip data cache of 2kB, 
organized as a line size of 4 bytes (one word). This relatively 
large data cache achieves hit rates well in excess of 90% in 
most applications, and contributes substantially to the perfor- 
mance inherent in the R3051 family. As with the instruction 
cache, the data cache is implemented as a direct mapped 


physical address cache. The cache is capable of mapping any 
word within the 4GB physical address space. 

The data cache is implemented as a write through cache, 
to insure that main memory is always consistent with the 
internal cache. In order to minimize processor stalls due to 
data write operations, the bus interface unit incorporates a 4- 
deep write buffer which captures address and data at the 
processor execution rate, allowing it to be retired to main 
memory at a much slower rate without impacting system 
performance. 

Bus Interface Unit 

The R3051 family uses its large internal caches to provide 
the majority of the bandwidth requirements of the execution 
engine, and thus can utilize a simple bus interface connected 
to slow memory devices. 

The R3051 family bus interface utilizes a 32-bit address 
and data bus multiplexed onto a single set of pins. The bus 
interface unit also provides an ALE signal to de-multiplex the 
A/D bus, and simple handshake signals to process processor 
read and write requests, in addition to the read and write 
interface, the R3051 family incorporates a DMA arbiter, to 
allow an external master to control the external bus. 

The R3051 family incorporates a 4-deep write buffer to 
decouple the speed of the execution engine from the speed of 
the memory system. The write buffers capture and FIFO 
processor address and data information in store operations, 
and presents it to the bus interface as write transactions at the 
rate the memory system can accommodate. Figure 5 illustrates 
a basic write transaction for the R3051/52. 
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Figure 5. IDT R3051 Family Write Operation (Two Bus Wait Cycles) 
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The R3051/52 read interface performs both single word 
reads and quad word reads. Single word reads work with a 
simple handshake, and quad word reads can either utilize the 
simple handshake (in lower performance, simple systems) or 
utilize a tighter timing mode when the memory system can 
burst data at the processor clock rate. Thus, the system 
designer can choose to utilize page or nibble mode DRAMs 
(and possibly use interleaving), if desired, in high-performance 
systems, or use simpler techniques to reduce complexity. 
Figure 6 illustrates a basic single word read; figure 7 illustrates 
a burst block transfer. More aggressive designs could signifi- 
cantly reduce the numberof processor stall cycles from those 
shown here. 


In order to accommodate slower quad word reads, the 
R3051 family incorporates a 4-deep read buffer FIFO, so that 
the external interface can queue up data within the processor 
before releasing it to perform a burst fill of the internal caches. 
Figure 8 shows the action of the processor for a “throttled” 
quad word read. Depending on the cost vs. performance 
tradeoffs appropriate to a given application, the system design 
engineer could include true burst support from the DRAM to 
provide for high-performance cache miss processing, or uti- 
lize the read buffer to process quad word reads from slower 
memory systems. 
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Figure 6. IDT R3051 Family Single Word Read Operation (Two Bus Wait Cycles) 
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Figure 7. IDT R3051 Family Burst Read Operation (Two Bus Wait Cycles) 
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Figure 8. IDT R3051 Family Throttled Quad Read Operation (Three Bus Walt Cycles, One Bus Wait Cycle Between Words) 
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SYSTEM USAGE 

The IDT R3051 family has been specifically designed to 
easily connect to low-cost memory systems. Typical low-cost 
memory systems utilize slow EPROMs, DRAMs, and appli- 
cation specific peripherals. These systems may also typically 
contain large, slow static RAMs, although the IDT R3051 
family has been designed to not specifically require the use of 
external SRAMs. 

Figure 9 shows a typical system block diagram. Transpar- 
ent latches are used to de-multiplex the R3051/52 address 
and data busses from the A/D bus. The data paths between 


the memory system elements and the R3051/52 A/D bus is 
managed by simple octal devices. A small set of simple PALs 
can be used to control the various data path elements, and to 
control the handshake between the memory devices and the 
R3051/52. 

Alternately, the memory interface can be constructed using 
the IDT R3051 family RISChipset, which includes DRAM 
control, data path control for interleaved memories, and other 
general memory and system interface control functions. These 
devices are described in separate data sheets. Figure 10 
illustrates a simple system constructed using the R3051 
family support chip set. 
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Figure 9. Typical R3051 Family Based System 
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Figure 10. R3051 Family Chip Set Based System 
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DEVELOPMENT SUPPORT 

The IDT R3051 family is supported by a rich set of devel- 
opment tools, ranging from system simulation tools through 
prom monitor support, logic analysis tools, and sub-system 
modules. 

Figure 11 is an overview of the system development 
process typically used when developing R3051 family-based 
applications. TheR3051 family is supported by powerful tools 
through all phases of project development. These tools allow 
timely, parallel development of hardware and software for 
R3051/52 based applications, and include tools such as: 

• A program, Cache-3051 , which allows the performance of 
an R3051 family based system to be modeled and 
understood without requiring actual hardware. 

• Sable, an instruction set simulator. 

• Optimizing compilers from MIPS, the acknowledged 
leader in optimizing compiler technology. 


IDT Cross development tools, available in a variety of 
development environments. 

The high-performance IDT floating point library software. 
The IDT Evaluation Board, which includes RAM, 
EPROM, I/O, and the IDT Prom Monitor. 

The IDT Laser Printer System board, which directly 
drives a low-cost print engine, and runs Microsoft 
Truelmage™ Page Description Language on top of 
PeerlessPage™ Advanced Printer Controller BIOS. 
Adobe PostScript™ Page Description Language, ported 
to the R3000 instruction set, runs on the IDT R3051 
family. 

The IDT Prom Monitor, which implements a full prom 
monitor (diagnostics, remote debug support, peek/poke, 
etc.). 
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Figure 11. R3051 Family Development Toolchain 
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PERFORMANCE OVERVIEW 

The R3051 family achievesaveryhigh-levelof performance. 

This performance is based on: 

■ An efficient execution engine. The CPU performs ALU 
operations and store operations at a single cycle rate, 
and has an effective load time of 1 .3 cycles, and branch 
execution rate of 1 .5 cycles (based on the ability of the 
compilers to avoid software interlocks). Thus, the 
execution engine achieves over 35 MIPS performance 
when operating out of cache. 

• Large on-chlp caches. The R3051 family contains 
caches which are substantially larger than those on the 
majority of today’s embedded microprocessors. These 
large caches minimize the number of bus transactions 
required, and allow the R3051 family to achieve actual 
sustained performance very close to its peak execution 
rate. 

• Autonomous multiply and divide operations. The 

R3051 family features an on-chip integer multiplier/divide 


unit which is separate from the other ALU. This allows 
the R3051 family to perform multiply or divide operations 
in parallel with other integer operations, using a single 
multiply or divide instruction rather than “step” operations. 

• integrated write buffer. The R3051 family features a 
four deep write buffer, which captures store target 
addresses and data at the processor execution rate and 
retires it to main memory at the slower main memory 
access rate. Use of on-chip write buffers eliminates the 
need for the processor to stall when performing store 
operations. 

• Burst read support. The R3051 family enables the 
system designer to utilize page mode or nibble mode 
RAMs when performing read operations to minimize the 
main memory read penalty and increase the effective 
cache hit rates. 

These techniques combine to allow the processorto achieve 
35 MIPS integer performance, and over 64,000 dhrystones at 
40 MHz without the use of external caches or zero wait-state 
memory devices. 
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PIN DESCRIPTION 


Pin Name 

I/O 

Description 

A/D(31 :0) 

I/O 

Address/Data: A 32-bit time multiplexed bus which indicates the desired address for a bus transaction 
in one cycle, and which is used to transmit data between this device and external memory resources on 
other cycles. 

Bus transactions on this bus are logically separated into two phases: during the first phase, information 
about the transfer is presented to the memory system to be captured using the ALE output. This 
information consists of: 

Address{31 :4): The high-order address for the transfer is presented. 

BE(3:0): , These strobes indicate which bytes of the 32-bit bus will be involved in 

the transfer. 

During write cycles, the bus contains the data to be stored and is driven from the internal write buffer. 
On read cycles, the bus receives the data from the external resource, in either a single word 
transaction or in a burst of four words, and places it into the on-chip read buffer. 

Addr(3:2) 

o 

Low Address (3:2) A 2-bit bus which indicates which word is currently expected by the processor. 
Specifically, this two bit bus presents either the address bits for the single word to be transferred (writes 
or single word reads) or functions as a two bit counter starting at ‘00’ for burst read operations. 

Diag(1) 

o 

Diagnostic Pin 1. This output indicates whether the current bus read transaction is due to an on- 
chip cache miss, and also presents part of the miss address. The value output on this pin is time 
multiplexed: 

Cached: During the phase in which the A/D bus presents address information, this 

pin is an active high output which indicates whether the current read is 
a result of a cache miss.The value of this pin at this time in other than 
read cycles is undefined. 

Miss Address (3): During the remainder of the read operation, this output presents 

address bit (3) of the address the processor was attempting to 
reference when the cache miss occurred. Regardless of whether a 
cache miss is being processed, this pin reports the transfer address 
during this time. 

Diag(O) 

o 

Diagnostic Pin 0. This output distinguishes cache misses due to instruction references from those 
due to data references, and presents the remaining bit of the miss address. The value output on this 
pin is also time multiplexed: 

l/D: If the "Cached” Pin indicates a cache miss, then a high on this pin at this 

time indicates an instruction reference, and a low indicates a data 
reference. If the read is not due to a cache miss but rather an uncached 
reference, then this pin is undefined during this phase. 

Miss Address (2): During the remainder of the read operation, this output presents 

address bit (2) of the address the processor was attempting to 
reference when the cache miss occurred. Regardless of whether a 
cache miss is being processed, this pin reports the transfer address 
during this time. 

ALE 

o 

Address Latch Enable: Used to indicate that the A/D bus contains valid address information for 
the bus transaction. This signal is used by external logic to capture the address for the transfer. 

DataEn 

o 

Data Input Enable: This signal indicates that the A/D bus is no longer being driven by the processor 
during read cycles, and thus the external memory system may enable the drivers of the memory 
system onto this bus without having a bus conflict occur. During write cycles, or when no bus 
transaction is occurring, this signal is negated. 
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PIN DESCRIPTION (Continued) 


Pin Name 

I/O 

Description 

Burst/ 

WrNear 

o 

Burst Transfer/Write Near: On read transactions, this signal indicates that the current bus read 
is requesting a block of four contiguous words from memory. This signal is asserted only in read cycles 
due to cache misses; it is asserted for all 1-Cache miss read cycles, and for D-Cache miss read cycles 
if selected at device reset time. 

On write transactions, this output tells the external memory system that the bus interface unit is 
performing back-to-back write transactions to an address within the same 256 byte page as the prior 
write transaction. This signal is useful in memory systems which employ page mode or static column 
DRAMs. 

Rd 

o 

Read: An output which indicates that the current bus transaction is a read. 

Wr 

o 

Write: An output which indicates that the current bus transaction is a write. 

Ack 

■ 

Acknowledge: An input which indicates to the device that the memory system has sufficiently 
processed the bus transaction, and that the CPU may either advance to the next write buffer entry or 
process the read data. 

RdCEn 

. 

■ 

Read Buffer Clock Enable: An input which indicates to the device that the memory system has 
placed valid data on the A/D bus, and that the processor may move the data into the on-chip Read 
Buffer. 

SysClk 

o 

System Reference Clock: An output from the CPU which reflects the timing of the internal 
processor sys clock. This clock is used to control state transitions in the read buffer, write buffer, 
memory controller, and bus interface unit. 

BusReq 

■ 

DMA Arbiter Bus Request: An input to the device which requests that the CPU tri-state its bus 
interface signals so that they may be driven by an external master. 

BusGnt 

o 

DMA Arbiter Bus Grant. An output from the CPU used to acknowledge that a BusReq has been 
detected, and that the bus is relinquished to the external master. 

SBrCond(3:2) 

BrCond(1:0) 

1 

Branch Condition Port: These external signals are internally connected to the CPU signals 
CpCond(3:0). These signals can be used by the branch on co-processor condition instructions as input 
ports. There are two types of Branch Condition inputs: the SBrCond inputs have special internal 
logic to synchronize the inputs, and thus may be driven by asynchronous agents. The direct Branch 
Condition inputs must be driven synchronously. 

BErr 

■ 

Bus Error: Input to the bus interface unit to terminate a bus transaction due to an external bus error. 
This signal is only sampled during read and write operations. If the bus transaction is a read operation, 
then the CPU will take a bus error exception. 

Irvt(5:3) 

Slnt(2:0) 

1 

Processor Interrupt: During operation, these signals are logically the same as the lnt(5:0) signals 
of the R3000. During processor reset, these signals perform mode initialization of the CPU, but in a 
different (simpler) fashion than the interrupt signals of the R3000. 

There are two types of interrupt inputs: the Sint inputs are internally synchronized by the processor, 
and may be driven by an asynchronous external agent. The other interrupt inputs are not internally 
synchronized. The direct interrupt inputs have one cycle lower latency than the synchronized 
interrupts. 

Clk2xln 


Master clock Input: This is a double frequency input used to control the timing of the CPU. 

Internally, the clock generator unit derives the four processor "2xclk” signals from this clock. 

Reset 

■ 

Master Processor Reset: This signal initializes the CPU. Mode selection is performed during 
the last cycle of reset. 

Rsvd(4:0) 

I/O 

Reserved: These five signal pins are reserved for testing and for future revisions of this device. 

Users must not connect these pins. 
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ABSOLUTE MAXIMUM RATINGS 0 ’ 3) 


Symbol 

Rating 

Commercial 

■HI 


Vterm 

Terminal Voltage 
with Respect 
to GND 

-0.5 to +7.0 

-0.5 to +7.0 

V 

Ta.Tc 

Operating 

Temperature 

0 to +70 
(Ambient) 

-55 to +125 
(Case) 

i 

Tbias 

Temperature 

Under Bias 

-55 to +125 

-65 to +135 


Tstg 

Storage 

Temperature 

-55 to +125 

—65 to +155 

°c 

Vin 

Input Voltage 

-0.5 to +7.0 

-0.5 to +7.0 

V 


NOTE: 

1 . Stresses greater than those listed under ABSOLUTE MAXI MUM RATI NGS 
may cause permanent damage to the device. This is a stress rating only 
and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended 
periods may affect reliability. 

2. Vin minimum = -3.0V for pulse width less than 15ns. 

Vin should not exceed Vcc +0.5 Volts. 

3. Not more than one output should be shorted at a time. Durationoftheshort 
should not exceed 30 seconds. 

AC TEST CONDITIONS 


RECOMMENDED OPERATING 
TEMPERATURE AND SUPPLY VOLTAGE 


Grade 

Temperature 

GND 

Vcc 

Military 

-55 6 C to +125°C 
(Case) 

OV 

5.0 ±10% 

Commercial 

0°C to +70°C 
(Ambient) 

0V 

5.0 ±5% 


OUTPUT LOADING FOR AC TESTING 



Symbol 

Parameter 

Min. 

Max. 

Unit 

VlH 

Input HIGH Voltage 

3.0 

— 

V 

VlL 

Input LOW Voltage 

— 

0.4 

V 

VlHS 

Input HIGH Voltage 

3.5 

— . 

V 

VlLS 

Input LOW Voltage 

— 

0.4 

V 
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DC ELECTRICAL CHARACTERISTICS— 

COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V ±5%) 


Symbol 

Parameter 

Test Conditions 

20MHz 

25MHz 

33.33MHz 

40MHz 

Unit 

Min. 


Min. 


Min. 


M 


VOH 

Output HIGH Voltage 

Vcc = Min., Ioh = -4mA 

3.5 

— 

3.5 

— 

3.5 

■1 

3.5 

— 

■a 

VOL 

Output LOW Voltage 

Vcc « Min., Iol = 4mA 

— 

EM 

— 

0.4 

— 


El 

|2| 

Kfl 

VIH 

Input HIGH Voltage^ 3 ) 


2.0 

— 


— 


— 

1 2.0 „ 


D 

Vil 

Input LOW Voltage! 1 ) 


— 

0.8 

— 

0.8 

— 

, m 

, "2- 

0.8 


Vihs 

Input HIGH Voltage! 2 ’ 3 ) 


3.0 

— 


— 

, ;: 3:Qi 

1 — 


— 


Vils 

Input LOW Voltage! 1 ' 2 ) 


— 

0.4 

— 

q.4! 

■■ 

0.4 

— 

EM 

D 

Cin 

Input Capacitance! 4 ) 


— 

10 

— < 

mm 

* — 

10 

— 

10 

m 

COUT 

Output Capacitance! 4 ) 


— 

10 



— 

10 

— 

10 

ca 

ICC 

Operating Current 

VCC - 5 V, TA = 70°C 

— 

350 ; 


400 

— 

500 

— 

600 


IlH 

Input HIGH Leakage 

Vih - VCC 


100 

— 

100 

— 

100 

— 

100 


IlL 

Input LOW Leakage 

Vil = GND 

-100 

— 

-100 

— 

-100 

— 

-100 

— 

pA 

loz 

Output Tri-state Leakage 

Voh - 2.4V, VOL = 0.5V 

-100 

100 

-100 

100 

-100 

100 

-100 

100 

IQ 


NOTES: 

1. Vil Min. = -3.0V for pulse width les s than 15ns. Vil should not fall below -0.5 Volts for larger periods. 

2. Vihs and Vils apply to Clk2xln and Reset. 

3. Vih should not be held above Vcc + 0.5 volts. 

4. Guaranteed by design. 
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AC ELECTRICAL CHARACTERISTICS (1 * 2< 3) — 

COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V ±5%) 


33.33MHz 


Symbol 

Signals 

Description 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. Unit 

ti 

BusReq, Ack, BusError, Sint, 
RJCEn, Tint, BrCond, SBrCond 

Set-up to SysClk rising 

6 

— 

5 

— 

4 

— 

m 

— ns 

tla 

A/D 

Set-up to SysClk falling 

6 

— 

5 

— 

4 

— = 


BD 

t2 

BusFteq, Ack, BusError, Sint, 
FtdCEn, Tnt, BrCond, SBrCond 

Hold from SysClk rising 

4 

- 

4 

- 

3 


3 

— ns 

t2a 

A/D 

Hold from SysClk falling 

2 

— 

2 

— 

1 

— 

IjiI 

— 

t3 

A/D, Addr, Diaa, ALE, WF 
Burst/Wrhlear, Rd, DataEn 

Tri-state from SysClk rising 

— 

10 

— 

10 

— 

10 

— 

■m 

t4 

A/D, Addr, Diag, ALE, WF 
Burst/WrNear, Rd, DataEn 

Driven from SysClk falling 

— 

10 

— 

10 

— 

10 

f — 

HU 

t5 

BusGnt 


— 

8 

— 

7 

— 



t6 

BusGnt 


— 

8 

— 

7 

— 

6 

— 


t7 

Wr, Fid, Burst/WrNear, A/D 

BSHai 

— 

5 

— 

5 

—If 

mm 

— 


t8 

ALE 

BUB MM 

— 

4 

— 

4 


HUBi 

— 


t9 

ALE 


— 

4 

— 

4 


Ills 

— ' 


tio 

A/D 


2 

— 

2 

— 

mm 

— 

mm 


til 

bataEn 


— 

o 

— 

o 


mm 

— 


t12 

DataEn 

Asserted from A/D tri-state< 4 > 

0 

— 

0 

— 

wm 

f ~ 

0 


t14 

A/D 


0 

— 

0 

— 

0 

— 

0 






All outputs 


Timing deration for loading 
over 25pf( 4 ' 5 * 


NOTES: 

1. All timings referenced to 1.5 Volts. 

2. All outputs tested with 25 pF loading. 

3. The AC values listed here reference timing diagrams contained in the R3051 Family Hardware User’s Manual. 

4. Guaranteed by design. 

5. This parameter is used to derate the AC timings according to the loading of the system. This parameter provides a deration for loads over the specified 
test condition; that is, the deration factor is applied for each 25 pF over the specified test load condition. 
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ADVANCE INFORMATION 


ORDERING INFORMATION 


XXXXX - XX X X 

IDT 

Device Type Speed Package Process/ 



Blank 

Commercial Temperature Range 

•B* 

Compliant to MIL-STD-883, Class B 

'M' 

Military Temperature Range Only 

•J 1 

84-Pin PLCC 

'QJ' 

84-Pin J-Bend Cerpack 

•20' 

20.0 MHz 

'25' 

25.0 MHz 

'33' 

33.33 MHz 

’40’ 

40.0 MHz 

79R3051 

4kB Instruction Cache, No TLB 

79R3051E 

4kB Instruction Cache, With TLB 

79R3052 

8kB Instruction Cache, No TLB 

79R3052E 

8kB Instruction Cache, With TLB 
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Integrated Device Technology, Inc. 


THIRD GENERATION 
MIPS RISC PROCESSOR 


ADVANCE 

INFORMATION 

IDT79R4000 


FEATURES: 

• High-Performance, Highly Integrated CPU 

• Fully Binary Compatible with R2000, R3000 CPUs 

• Capable of over 50 VAX MIPS sustained system 
performance 

• High-level of performance 

— Utilizes super-pipelining to exploit 2-level instruc- 
tion level parallelism with no issue restrictions 

— Balanced integer and floating point performance 

— 64-bit floating point extensions 

— Multi-processing support 


• High-Level of Integration 

— RISC Integer Unit 

— IEEE Compatible Floating Point Units 

— Memory Management Unit 

— 8kB Instruction Cache 

— 8kB Data Cache 

— Direct control of optional secondary cache 

— Extensive multi-processing support 


BLOCK DIAGRAM 


Memory and 
Secondary Cache 
Interface 


Super-pipelined 
Control Unit 


Register File 
32 x 32-bits 


Floating Point 
Execution Units 


Floating Point Memory Management 

Units and 

(CPI) Exception Handling 

CEMOS Is atrademarKot Integrated Device Technology, Inc. 


Register File 
32 x 32-bits 


Integer Multiply/ 
Divide 


ALU 


Integer 

Unit 


Exception 

Handling 

Registers 


Memory 

Management 

Registers 


Translation 

Lookaside 

Buffer 


8k Bytes 
Data 
Cache 


8k Bytes 
Instruction 
Cache 


NOVEMBER 1990 


©1990 Integrated Device Technology, Inc. 
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ADVANCE INFORMATION 


DESCRIPTION: 


Exploiting Instruction Level Parallelism 


The R4000 is the third generation of MIPS RISC technol- 
ogy, continuing MIPS track record as the performance leader 
and establishing a new performance standard for the 1990’s. 

The R4000 maintains full binary compatibility with applica- 
tions executing on the R2000 and R3000 MIPS RISC CPUs 
(also available from IDT) and IDT’s RISControllei™ family, 
while achieving substantially higher performance. The key to 
this performance standard is both the architecture/implemen- 
tation of the processor, and the level of integration achieved 
in a single chip. The R4000 contains both a high-performance 
execution core (integer and floating point) as well as sufficient 
memory bandwidth (large on-chip caches) to keep the execu- 
tion engine running. The on-chip resources are comple- 
mented by a direct interface to an optional secondary cache, 
and also by multi-processing support, allowing the system 
designerto further increase memory bandwidth to the proces- 
sor, and to operate numerous processors togetherto increase 
overall computational power. This balanced architectural 
approach allows R4000 based systems to achieve a wide 
range of price performance goals. 

Keys to Performance 

All microprocessors are governed by the same basic per- 
formance equation: the time required to perform a given task 
is the product of the number of instructions required to execute 
the task with the <average> time required to complete an 
instruction. MIPS optimizing compilers, and the MIPS RISC 
architecture, serve to minimize the first term in the product. 
The R4000 maintains the same focus on compiler technology 
as an extension of the CPU architecture as did the earlier 
generations of MIPS processors. 

The remaining term in the performance equation is the 
average amount of time required to execute instructions. The 
R4000 is designed to exploit 2-level instruction parallelism, 
thus being able to retire 2 instructions per clock cycle (sus- 
tained). Further, the architecture, and the level of integration, 
allow substantially faster clock rates to be used. The combi- 
nation of fast clock rates, and multiple instructions per clock 
cycle, minimize the average time per instruction. 


There are a number of techniques available to achieve 
multiple instructions per clock cycle. The two discussed most 
frequently are super-scalar, and super-pipelined architec- 
tures. These machines attempt to initiate multiple instructions 
per clock cycle, as long as there are no data dependencies 
between the instructions. Thus, itisuptotheCPU(ratherthan 
the compilers or programmer) to detect and exploit this 
"instruction level parallelism" to increase performance. 

Superscalar machines attempt to run multiple instructions 
in distinct pipelines. In order to accomplish this, execution 
resources must be replicated in each pipeline. Further, 
significant logic must exist between the pipelines to insure that 
data dependencies amongst multiple instructions are re- 
solved properly, and to insure that exceptions are detected 
and handled precisely. Figure 2 illustrates a theoretical super- 
scalar machine of degree 2. 

Superpipelined machines attempt to initiate multiple in- 
structions per clock cycle, sequentially. In order to do this, 
execution units in the basic machine pipeline which have long 
latencies (require a long time to complete their operation) 
must be pipelined, so that multiple instructions can be executing 
simultaneously (although sequentially) in those units. In order 
to achieve high performance, the speed of the individual 
pipestages is higher than in equivalent superscalar imple- 
mentation. However, each pipestage is significantly less 
complex than the super-scalar equivalent, allowing these 
higher speeds to be achieved. Figure 3 shows the equivalent 
super-pipelined machine of degree 2 (not intended to repre- 
sent R4000 pipeline). 

Studies have shown that architecturally, superscalar and 
superpipelined are duals of each other: that is, each approach 
is roughly equivalent in its ability to exploit instruction level 
parallelism. Differences in performance will then be related to 
the actual implementation of those techniques in a given 
processor, constrained by the current semiconductor tech- 
nology. Compromises and trade-offs which may reduce 
effectiveness from the theoretical machine include: 
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Figure 2. Superscalar RISC Pipeline 
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Figure 3. Superpipelined RISC Pipeline 


• Issue Restrictions: Due to the complexity and implemen- 
tation cost of replicating execution units, and checking for 
data dependencies between parallel pipelines, many su- 
per-scalar machines institute restrictions in the types of 
operations that may be initiated in parallel. This degrades 
the chip from the theoretical performance of a true super- 
scalar machine. 

• Clock Frequency: the complexity of an implementation 
affects the clock frequencies achievable. A machine de- 
signed for instruction level parallelism may become so 
complex that the clock frequency is adversely affected. This 
can be a greaterfactor in multi-chip implementations, where 
significant speed is lost in bringing signals from one pack- 
aged part to another across a PC board. 

• Memory Bandwidth: a high-performance microprocessor 
needs substantial memory bandwidth to achieve its perfor- 
mance potential. If the implementation is too complex, there 
may not be enough room for adequate caches to keep the 
execution engine fed. A good implementation is able to 
integrate sufficient cache memory to allow the execution 
engine to frequently operate at its peak performance rating. 

• Compiler technology: the compiler technology must be 
capable of generating efficient code for the execution engine. 
In the case of a machine with issue restrictions, complex 
peephole optimizations may be required to maximize par- 
allel operation. These optimizations are generally outside 
the realm of other traditional optimizations required for more 
general (less restrictive) machines. 

• Level of parallelism exploited. A tradeoff can be made 
between machines with significant issue restrictions but 
high peak parallelism and machines with few or no issue 
restrictions but less peak parallelism. Numerous studies 
have shown that a machine capable of exploiting two level 
instruction parallelism, with no issue restrictions, will out- 
perform a machine capable of exploiting 4-level parallelism 
but which implements significant issue restrictions. Further, 
the amount of parallelism exploited by the machine may 
cause tradeoffs in other areas, such as amount of cache or 
clock frequency. 


Based on these constraints, the architects of the R4000 
have implemented a super-pipelined execution engine to 
exploit 2-level instruction parallelism, with no issue restric- 
tions, and with substantial primary instruction and data caches. 
The R4000 is thus capable of the raw execution speed 
required to achieve high-performance, and supplies sufficient 
bandwidth from its primary caches to minimize main memory 
cycles. Finally, the R4000 is able to benefit from the strength 
of the MIPS optimizing compilertechnology, without requiring 
an arbitrarily complex peephole scheduler. 

Level of Integration: 

The R4000 brings all of the execution resources necessary 
for a high-performance computing system into a single chip. 
These resources include: 

• A High Performance Execution Engine. The R4000 uti- 
lizes a super-pipelined execution engine, while maintaining 
full binary compatibility with the R2000/R3000. 

• Full featured MMU. The R4000 integrates memory man- 
agement and exception handling facilities on-chip as the 
system control co-processor (CPO), thus not requiring an 
external MMU device. 

• High Performance Floating Point Accelerator. The R4000 
integrates single and double precision floating point on- 
chip, as co-processor 1 . 

• Large primary caches: the R4000 integrates large (8k 
bytes each) instruction and data caches on-chip. These 
large caches allow the execution resources to operate at 
peak rates through substantial amounts of the application, 
resulting in high actual system performance, not just peak 
native MIPS. 

• Directsupportforoptionalsecondarycache. TheR4000 
incorporates the ability to implement an external secondary 
cache, to further increase processor bandwidth. This is 
especially important in multi-processing systems. 

• Multi-processing support. The R4000 provides the 
support necessary to implement high-performance, multi- 
processing systems. 
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APPLICATIONS 

The R4000 extends the performance range served by the 
MIPS architecture into higher levels of performance. The 
R4000 provides a high-performance migration path to those 
applications currently served by devices such as the R3000, 
R3001, and R3051. 

The MIPS RISC architecture has found widespread accep- 
tance in a number of applications. These include: 


• High-performance multi-processing systems. Further 
computational throughput can be achieved by implement- 
ing multiple R4000 in a single system, as illustrated in 
figure 4. MIPS RISC is already well represented in multi- 
processing applications, including systems from Silicon 
Graphics, Stardent Computer, and Digital Equipment 
Corporation. The R4000 allows these systems to imple- 
ment even higher performance in each CPU, increasing 
overall system capability. 



Figure 4. R4000-Based Multi-Processing System 


• Real-time systems. The Joint Integrated Avionics 
Working Group (JIAWG) committee has selected the 
MIPS RISC architecture as a standard for military avion- 
ics. The R4000 allows these real-time applications to 
benefit from high integration and CPU performance. 

• Embedded computing systems. The MIPS RISC 
architecture has won designs in a number of embedded 
systems applications, including laser printers, graphics 
systems, and data communications. A typical high- 
performance embedded system built around the R4000 is 
illustrated in figure 5. 



Figure 5. R4000-Based Embedded System 
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• Desktop workstations. MIPS RISC is a leading archi- 
tecture in UNIX™ based workstations, from vendors such 
as MIPS, Digital Equipment Corporation, and Silicon 
Graphics. The R4000 extends the performance range 
achievable in a desktop environment, while minimizing 
chip count (and thus real estate, cost, and power con- 
sumption) as illustrated in figure 6. 



(VME, TurboChannel, etc.) 


Figure 6. R4000-Based Desktop Workstation 


• Deskside server systems. The R4000 is also capable 
of supporting high performance server systems, such as 
systems built by MIPS and Digital Equipment (around the 
R3000) today. Implementing a secondary cache, and a 
larger I/O and main memory system, extends the basic 
UNIX system to a high-performance server system, as 
shown in figure 7. 



(VME, TurboChannel, etc.) 


Figure 7. R4000-Based Deskside Server System 


ADDITIONAL INFORMATION 

Additional information on the R4000 is available from IDT. 
Please contact your local sales representative for additional 
information on this product. 
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RISC SUPPORT COMPONENTS 


A RISC microprocessor is an important, but not self- 
sufficient, element of a high-performance general or embed- 
ded computing system. Equally important is the memory 
system (both cache and main memory) and the I/O interface 
to the execution core. 

To simplify the task of building these high-performance 
subsystems, IDT produces a wide variety of support chips 
and building block devices. These chips range from general 
purpose devices such as fast static RAM and high-perform- 
ance logic (used with many processor families) , to special- 
ized devices used in only certain types of applications (such 
as the IDT LaserFlFO, used in laser printer systems) and 
devices designed to work with only a specific processor 
family. 

Generic building block devices include SRAMs, with den- 
sities from 16KB to 1MB and access times as low as 7ns, as 
well as high-speed logic devices such as the FCT-T family. 

Devices specifically developed for RISC systems include 
the RISChipset™ — 3720 Bus Exchanger, 3721 DRAM 
Controller and the 3722 I/O Controller. These components 


facilitate design of systems based upon the R3051/52 con- 
troller family. The DRAM and I/O controllers have direct bus 
interface to the 3051/52. 

The R3020 Write Buffer enhances the performance of 
R3000 systems by allowing the processor to perform write 
operations at full clock speeds instead of resorting to time- 
consuming CPU stall cycles. The memory can then retire 
the data at a slower rate. The R32xx family of read/write 
buffers includes the memory read capability, enabling the 
use of slower main memory without impacting system per- 
formance. 

By providing these system solutions as building blocks, 
IDT allows its customers the maximum flexibility in achieving 
their price performance goals while minimizing time-to-mar- 
ket, real estate and complexity of the end system. 

This section of the data book contains some selected 
devices which have either been specifically designed for 
particular RISC processors or found to be exceptionally 
useful in these high-performance systems. 
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BUS EXCHANGER 
FOR R3051 FAMILY 


ADVANCE 

INFORMATION 

IDT79R3720 


FEATURES: 

• Direct Interface to R3051 Family RISChipSet™ 

— R3051™ Family of Integrated RISController™ 

CPUs 

— R3721 DRAM Controller 

— R3722 I/O Interface Controller 

• Interfaces a single CPU bus to interleaved or banked 
memory systems 

• Data path for read and write operations 

• Low noise outputs 

• Supports R3051 family systems from 20 to 33MHz 

• Simplifies data path design in high-performance memory 
systems 

• 3-Bus Architecture 

— One CPU Bus 

— Two (interleaved or banked) memory busses 

— Each bus independently latched to support 
asynchronous operation 

• 68-Pin PLCC Package 

• High-performance CEMOS™ technology 


DESCRIPTION: 

The R3720 Bus Exchanger is designed to provide data path 
support in an R3051 family system utilizing interleaved or 
banked memory techniques. The Bus Exchanger is respon- 
sible for interfacing between the CPU A/D bus (CPU address/ 
data bus) and multiple memory data busses. 

Thus, the Bus Exchanger uses a three bus architecture, 
with control signals suitable for simple transfer between the 
CPU bus and either memory bus. The Bus Exchanger 
features independent read and write latches for each memory 
bus, thus supporting a variety of memory strategies. 

The bus exchanger can be used as a simple transceiver, 
passing data between the single CPU bus and the pair of 
memory busses. Alternately, data from any of the three ports 
can be latched by the Bus Exchanger, to free the sending port 
(memory, during reads, and the CPU, during writes) while the 
receiving port processes the transfer at its own rate. 

The difference in operation is accomplished through the 
use of a simple set of control signals. These signals include 
independent latch enables for all three ports, signals to 
indicate the direction of transfer (read or write) and which 
memory port is involved, and other signals which can be used 
to force the ports to operate as either a transceiver or a true 
latch. 


BLOCK DIAGRAM 
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IDT79R3720 BUS EXCHANGER 


USE AS PART OF THE R3051 FAMILY CHIPSET 

When used with an R3051 family CPU, a pair of bus 
exchangers are typically used as illustrated in figure 2. 

The bus exchanger is typically used as an integrated 
transceiver in R3051 family applications; that is, the latches 
are held "open" through the transfer. In such an application, 
the single bus exchanger replaces 8 basic transceivers plus 
logic to coordinate data flow between the paths. 

The system memory model determines the mapping be- 
tween processor addresses and memory ports. In a system 


ADVANCE INFORMATION 


with two non-interleaved banks of memory, a high-order 
address bit from the processor determines which memory port 
is being accessed. In an interleaved memory system, Addr(2) 
is used to alternate between two banks of memory. Both these 
cases are handled by the R3721 DRAM controller for the 
R3051 family. 

The R3721 DRAM controller for the R3051 family uses the 
bus exchanger as a simple set of transceivers. The R3721 
directly controls the inputs of the bus exchanger, during both 
reads and writes. 



Figure 2. Bus Exchanger Used in R3051 Family System 
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DRAM CONTROLLER 
FOR R3051 FAMILY 


ADVANCE 

INFORMATION 

IDT79R3721 


FEATURES: 

• Direct Interface to R3051™ Family RISController™ Chip 

Set 

— R3051 Family Integrated RISController CPU 

— R3720 Bus Exchanger 

— R3722 I/O Interface Controller 

• Directly drives DRAM address and control signals 

— Directly drive 36 DRAM devices or multiple SIMM 
modules 

— High output capability requires no external drivers 
for DRAM address or control signals 

— Replicated control signals allows control of multiple 
memory banks 

— Directly drives Bus Exchanger or 74FCT245 data 
path buffers 

• Low noise outputs with built-in series resistance for 

direct drive of DRAMS (large capacitive load) 

• 20, 25, and 33 MHz Operation 

• Simplifies system design by eliminating glue logic/PALs 


• High performance from low cost DRAMs 

— Programmable timing model for 60-120 ns 
DRAMs 

— Supports Page Mode read and writes using on-chip 
Page Detection 

— Supports R3051 Family cache burst refill 

— DMA interface for burst DMA read or write 
accesses 

• Supports Multiple Common DRAM Configurations 

— 1MB to 16 MB 

— 256kx4 through 4Mx1 DRAMs 

— 1 , 2, or 4 Banks of DRAMs 

— Non-interleaved or Dual Interleaving Configurations 

— Page Mode, or Static Column Mode Accesses 

— CAS-before-RAS or RAS-only refresh timing 
control on-chip 

• Cascadeable to allow multiple DRAM controllers per 
system 

• Low noise TTL level outputs for CPU interface 

• High-performance CEMOS™ technology 
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DESCRIPTION: 

The R3721 DRAM controller provides a direct interface 
between the R3051 family CPU and DRAM-based main 
memory. The R3721 requires no PALs, buffers, or drivers to 
interface to the CPU or to the DRAM devices. It is capable of 
being used either with the R3722 I/O Controller or in applica- 
tions which perform I/O functions using discrete or ASIC- 
based approaches. The R3721 DRAM controller thus dra- 
matically simplifies the problem of interfacing DRAMs to the 
R3051 family. 

Operation Overview 

The R3721 accepts a memory decode input from an 
external address decoder (for example, the R3722) and 
performs the full data transfer requested. Using an external 
input to initiate transfers allows multiple R3721's to be cas- 
caded in large memory systems. 

The R3721 allows a number of different memory configu- 
rations to be used, and a number of different speeds of DR AM 
devices to be used, at any given CPU speed. The various 
options are selected by performing a write to the internal mode 
register of the R3721 . Options in the memory configuration 
include: use of 256kx4 through 4Mx1 DRAM organizations: 
single, banked, or interleaved memory configurations: DRAM 
speeds of 60 - 120ns; and other options relating to DRAM 
refresh, use of static column mode, etc. Design decisions on 
which model best serves the application will be based upon 
the cost, performance, and main memory size requirements of 
the system. 


The R3721 performs reads and writes, as requested by the 
R3051 (or other bus master, in the case of systems which 
include DMA). The R3721 is capable of performing the single 
or quad word reads requested by the processor, aswellasthe 
buffered write (either near or not near in the DRAM page). 
Burst reads can be processed as either true burst, or throttled 
reads, depending on the memory configuration and speed. 
Refer to the R3051 Family data sheet or Hardware User's 
Guide for more information on the types of transfers performed 
by an R3051-family CPU. 

An operation is initiated when the R3721 is selected ex- 
ternally for a transfer. The R3721 then monitors the input 
control signals of the R3051 interface bus, and performs the 
indicated transfer. The R3721 performs the entire handsh ake 
requ ired by the R3051 family, including the use of RdCEn and 
Ack to control the transfer of data between the CPU and the 
DRAM. In addition to performing the control interface between 
the R3051 bus interface and the DRAM (including address- 
ing), the R3721 also manages the data path between the 
DRAMs and the R3051 family CPU, by directly controlling 
either a 74FCT245-type transceiver device (for single or 
simple banked designs) or an R3720 Bus Exchanger (for 
interleaved or banked memory systems). 

APPLICATIONS 

The R3721 is designed to interface directly to an R3051 
family CPU. Additionally, the R3721 works seamlessly with 
the R3720 Bus Exchanger (in interleaved memory systems) 
and the R3722 Integrated I/O controller. Figure 2 illustrates 
the construction of an R3051 family systems using this chip 
set. 
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Figure 2. Use of R372x RlSChlpset in R3051 Family System 
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I/O INTERFACE CONTROLLER 
FOR R3051 FAMILY 


ADVANCE 

INFORMATION 

IDT79R3722 


FEATURES: 

• Direct Interface to R305x RlSControllei™ RISChipSet™ 

— R3051 Family of Integrated RISController CPUs 

— R3721 DRAM Controller 

— R3720 Bus Exchanger 

• Supports R3051 family systems from 20 to 40MHz 

• Priority Interrupt Controller for On-Chip Peripherals 

— Nine Internal Interrupt Requests 

— Interrupt Pending Register 

— Each interrupt independently maskable 

• Three timer/counters with programmable counter modes 

• Programmable parallel I/O port 


• Boot interface supports byte-wide or 32-bit boot prom 

• DualUART 

— Full Duplex support 

— Programmable data formats 

— Programmable baud rates to 19.2K baud 

• High-performance DMA channels 

• Interfaces between R305x and external I/O devices 

— Address space decoding into 8 I/O segments 

— Programmable wait state generation per I/O 
segment 

• 132-Pin PQFP Package 


BLOCK DIAGRAM 


Int 


A/D(31:0) 

Addr(3:2) 

ALE 

SysClk 

Rd 

Wr 


RdCEn 

Ack 

Burst/WrNear 

BusReq 

BusGnt 

Reset 



PromSel 

PROMOE 

ByteEnable(3:0) 


MemSel 

DRAMRdCEn 

DRAMAck 

CS(7:0) 

lORd 

lOWr 


PIO(23:0) 


XI /X2 

TxD(1:0) 

RxD(1:0) 

T/CCIk 
Gate(1 :0) 
Out(1 :0) 

DReq(3:0) 

DAck(3:0) 

EOd(3:0) 


CEMOS, RISController, R305x, R3051, R3052 are trademarks of Integrated Device Technology, Inc. 

NOVEMBER 1990 


©1990 Integrated Device Technology, Inc. 


6.3 


DSC4052 /- 

1 
















IDT79R3722 I/O INTERFACE CONTROLLER 


ADVANCE INFORMATION 


DESCRIPTION: 

The R3722 is designed to link an R3051 family CPU to the 
various memory and I/O resources of the target application. In 
addition to performing the control functions required in the 
system, the R3722 also incorporates many of the simple 
peripheral functions found in typical embedded systems: 
parallel I/O, serial I/O, and counter/timers. Finally, the R3722 
incorporates DMA channels, designed to move data between 
the various resources underthe joint control of the R3051 and 
the R3722. 

The R3722 is really designed to eliminate the PAL and glue 
logic found in many embedded systems. The R3722 provides 
the functions typically performed by PALs, such as: address 
decoding; wait state generation; synchronization; and pe- 
ripheral control. 

Note that the R3722 does not directly perform DRAM 
conttrol; rather, it decodes references to DRAM, and provides 
an output signal to an external DRAM controller (such as the 
R3721). This is an appropriate trade-off, allowing the embed- 
ded sytem designer maximal flexibility in interfacing to the 
memory type, size, and organization appropriate for that 
application. 

OPERATION OVERVIEW 

The R3721 constantly monitors the R3051 bus; when a 
transfer is indicated (by the assertion of the ALE signal from 
theCPU),theR3721 performs address decoding. Depending 
on the transfer requested (read orwrite, and to which address), 
a number of different events may occur: 

The R3722 may perform the transfer itself (between the 
CPU and one of its internal peripherals); 


• It may assert one of its "Chip Select" outputs, indicating 
that a peripheral under its control is requested for the 
transfer. In this case, the R3722 will provide the hand- 
shake back to the CPU, controlling the wait states 
required to perform the access. 

• It may decode the reference as a reference to the boot 
prom space. In this case, the R3722 will perform a 32-bit 
transfer, or a series of 8-bit transfers, from the boot prom, 
depending on how the system has configured the boot 
prom interface. 

• It may decode the reference as a DRAM reference, in 
which case it will assert its DRAMSel output, indicating 
that the external DRAM controller is required to perform 
the transfer. In this case, the DRAM controller is respon- 
sible for wait state generation, allowing the tightest 
possible coupling (and timing handshake) between 
DRAM and the CPU. 

In addition to handling data transfers, the R3722 also 
performs basic peripheral functions. For these functions, it 
uses logic blocks which are operation and software compat- 
ible with the 825x series of peripheral devices. 

Finally, the R3722 performs DMA transfers between vari- 
ous memory regions. The R3722 is capable of using the 
R3051 bus at the same transfer speeds as those supported by 
the processor itself, resulting in the highest levels of DMA 
performance. 


APPLICATION OVERVIEW 


The R3722 is designed to interface directly to an R3051 
family CPU. Additionally, the R3722 works seamlessly with 
the R3721 DRAM Controller and R3720 Bus Exchanger. 
Figure 2 illustrates the construction of an R3051 family sys- 
tems using this chip set. 
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Figure 2. R3051 and RISChipset In an Embedded System 
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RISC CPU WRITE BUFFER 


IDT79R3020 


FEATURES: 

• Temporary storage buffers to enhance the performance of the 
IDT79R3000 RISC CPU processor 

• Allows for write operations by the RISC CPU processor during 
Run cycles 

• Each Write Buffer has four locations to handle an 8-bit 
address slice and a 9-bit data slice (including a parity bit) 

• High-speed CEMOS™ technology 

• Pin, functionally and software compatible with the MIPS 
Computer Systems R2020 Write Buffer 

• Speeds from 16.7 to 40 MHz 

• Military product compliant to MIL-STD-883, Class B 


DESCRIPTION: 

The IDT79R3020 Write Buffer enhances the performance of 
IDT79R3000 systems by allowing the processor to perform write 
operations during Run cycles instead of resorting to time- 
consuming stall cycles. Each IDT79R3020 device handles an 8-bit 
slice of address, and a 9-bit slice of data (one parity bit per byte); 
thus, four IDT79R3020s provide 4-deep buffering of 32 bits of 
address and 36 bits of data and parity. Figure 1 illustrates the 
functional position of the Write Buffer in an 1DT79R3000 system. 

Whenever the processor performs a write operation, the Write 
Buffer captures the output data and its address (including the 
access type bits). The Write Buffer can hold up to four data- 
address sets while it waits to pass the data on to main memory. 
Transfers from the processor to the write buffers occur synchro- 
nously at the cycle rate of the processor and the write buffer signals 
the processor if it is unable to accept data. The write buffer also 
provides a set of handshake signals to communicate with a main 
memory controller and coordinate the transfer of write data to main 
memory. 

The sections that follow describe these IDT79R3020 Write 
Buffer interfaces: 

• the processor-Write Buffer interface 

• the Write Buffer-main memory interface 

• a miscellaneous, Write Buffer-board control interface. 
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Figure 1. The IDT79R3020 Write Buffer in an IDT79R3000 System 
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WRITE BUFFER - IDT79R3000 PROCESSOR 
INTERFACE 

Figure 2 shows the signals comprising the Write Buffer interface 
to the IDT79R3000 (all descriptions assume that four IDT79R3020 
Write Buffers are used to implement a 32-bit, buffered interface). 
The AdrLo bus and Tag bus bits from the processor are both 


connected to the Write Buffer to form a 32-bit physical address that 
is captured by the buffers. Thirty-two bits of data, four bits of parity, 
and two access type bits are also captured by the Write Buffer. The 
paragraphs that follow describe the Write Buffer-processor inter- 
face signals and the timing of processor-to-Write Buffer data 
transfers. 



Figure 2. Write Buffer — IDT79R3000 Processor Interface 


Write Buffer-Processor Interface Signals 

Clock 

An inverted version of the IDT79R3000’s SysOut signal from the 
IDT79R3000 processor that synchronizes data transfers. The 
Write Buffer uses the trailing edge of Clocklo latch the contents of 
the AdrLo bus and uses the leading Clock edge to latch the con- 
tents of the Data and Tag buses. 

Dataln8:0 

Nine input data lines from the IDT79R3000 processor’s Data bus 
(eight bits of data and one bit of parity). 

Addrln7:0 

Eight input address lines from the IDT79R3000 processor. The 
address lines are taken from the AdrLo and Tag buses. 

Addressl :0 

The two least significant address bits from the IDT79R3000 proc- 
essor. These two address bits must be connected to all four Write 
Buffers and are used in conjunction with the access type 
( AccTyp1:0 ) signals, the Position 1:0 signals, and the BigEndian 
signal to determine which byte(s) in a word are being written into a 
particular Write Buffer. 

AccTypln1:0 

The access type signals from the IDT79R3000 processor speci- 
fying the size of a data access: word, tri-byte, half-word, or byte. 

WtMem 

This input is connected to the MemWr signal from the 
IDT79R3000 processor that is asserted whenever the processor is 
performing a store (write) operation. 

Request 

The primary purpose of this signal is to request access to mem- 
ory and is described later when th e Write Buffer-Main Memory 
Interface is discussed. The Request signal can also be connected 
to the CpCondO input of the IDT79R3000 and can then be tested 
by software to determine if there is any data in the Write Buffer. 


Since Request is deasserted if there is no data in the Write Buffer, 
software can determine if a previous write operation (for example, 
to an I/O device) has been completed before initiating a read or 
read status operation from that device. 

WbFull 

The W rite Buffer asserts this signal to the IDT79R3000’s 
WrBusy input whenever it cannot accept any more data; that is, 
when the current write will fill the buffer or the buffer has all ad- 
dress-data pairs occupied. The IDT79R3000 pro cessor performs 
a write-busy stall if it needs to store data while the WbFull/WrBusy 
signal is asserted. 

Data & Address Connections 

Figure 3 illustrates how four Write Buffers are connected to the 
address and data outputs of the IDT79R3000 processor. 
Address Inputs 

Each Write Buffer device has eight address inputs (Adrln7:0). 
The four low-order bits (Adrln3:0) are clocked into the device on 
the trailing edge of the Clock signal and are taken from the 
IDT79R3000's AdrLo bus. The four high-order bits (Adrln7:4) are 
clocked into the device on the rising edge of the Clock signal and 
are taken from the IDT79R3000’s Tag bus. 

Each device also has separate inputs (Addressl, AddressO) for 
the two low-order bits from the AdrLo bus. These bits must be 
input to each device since they comprise the byte pointer. Note in 
Figure 3 that the two low-order Adrln inputs (Adrlnl :0) to Write 
Buffer device 0 are connected to ground since the Addressl, 
AddressO inputs already supply these bits to the device. 

Data Inputs 

Each Write Buffer device has nine data inputs that are clocked 
into the device on the leading edge of the Clock signal and are 
taken from the IDT79R3000’s Data bus. In Figure 3, each device 
captures eight bits of data and one bit of parity. Also note that the 
data bits assigned to each device correspond to the address bits 
connected to the device. This arrangement is required since data 
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selection is dependent on a combination of the AccType signals simplifies system utilization of the "Read Error Address” feature 

and the two low order address bits. The arrangement also described later. 




Figure 3. Write Buffer Data and Address Line Connections 


The Positionl and PositionO signals shown in Figure 3 specify 
the nibble position within a halfword that each write buffer device 
comprises. 

Write Buffer - Processor Timing 

Transfers between the processor and the Write Buffers occur 
synchronously: the Clock signal from the processor is input to the 
Write Buffers and used to clock the address and data information 


into the Write Buffers’ latches. Figure 4 illustrates the timing for the 
processor- Write Buf fer interface. 

When the WrtMem signal is asserted, the low-order address bits, 
and the Address 1 :0 inputs, are latched on the trailing edge of the 
Clock signal (1). The rising edge of Clock (2) is used to latch the 
high-order address bits, the access type inputs and the contents of 
the data bus. 
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WRITE BUFFER - MAIN MEMORY INTERFACE 

Figure 5 shows the signals comprising the Write Buffer interface 
to main memory. This interface is essentially decoupled from the 
Write Buffer-processor interface: although some synchronization 


of the memory interface signals and the Clock signal is required, 
the handshaking signals in this interface have no direct connection 
to the operation of the Write Buffer-processor interface. 
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Figure 5. Write Buffer — Main Memory Interface 


Write Buffer - Main Memory Interface Signals 

Each Write Buffer provides the following signals that comprise 
the interface to a main memory controller: 

AddrOut7:0 

Eight address line output from each Write Buffer. 

PataOut 8:0 

Nine data lines from each Write Buffer (eight bits of data and one 
bit of parity). 

AccTypOut 1 :0 

The access type signals from the Write Buffer specifying the size 
of a data access: word, tri-byte, half-word, or byte. 


OutEn 

The memory controller asserts this write input to enable the tri- 
state outputs of the IDT79R3020 address and data signals. 

Request 

The Write Buffer asserts this signal to inform the main memory 
system that it has data to be written to memory. 

Acknowledge 

The main memory system asserts this signal when it has cap- 
tured the data presented by the Write Buffer on the DataOut lines. 
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Write Buffer • Main Memory Interface Timing 

Figure 6 illustrates the timing for the transfer of data from the 
Write Bufferto the main memory system. The sequence illustrated 
in this figure is as follows: 

1 ) When the Write Buffer has a data- address p air for transfer to 
the memory system, it asserts the Request signal. 

2) When memory sy stem is ready to handle the Write Buffer 
data, it asserts the OutEn signal to enable the Write Buffers' 
address and data outputs onto the system buses. 

3) When memory system no longer requires the Write Buffer 
address and data outputs, it asserts the Acknowledge signal. 


The Write Buffer responds to this signal by discarding the 
address-data pair that was just output. 

4) The memory system can deassert the OutEn signal to return 
the Write Buffers' address and data outputs to their tri-state 
condition. 

5) Since the Reques t signal remains asserted, the memory sys- 
tem asserts the OutEn signal again to enable the next 
address-data pair onto the system buses. 

6) When memory system has accepted the second address- 
data pair, it again asserts the Acknowledge signal. If the Write 
Buf fer is now empty, it responds to this signal by deasserting 
the Request signal. 



Note that the buffer’s interfac e to main memory is not completely 
asynchronous: assertion of the Request signal by the Write Buffer 
is synchronized with the rising edge of Clock, and the Acknowl- 
edge signal input by main memory has a minimum set up and hold 
time in relation to the Clock signal. 

MISCELLANEOUS WRITE BUFFER - BOARD 
LOGIC INTERFACE 

The Write Buffers support several functions that utilize signals 
that do not fit neatly into the descriptions of either the processor or 
main memory interfaces. These functions and signals typically 
involve miscellaneous logic on a CPU board and include the fol- 
lowing: 

• byte gathering 

• configuration connections (Big Endian, Position 1:0) 

• address matching logic 

• error address latch logic 

The sections that follow describe each of these categories. 

Byte Gathering 

The Write Buffers perform byte (half-word, tri-byte and word) 
gathering to decrease the number of write transfers to same loca- 
tion; that is, sequential writes to the same WORD address have 
their data combined into the same address-data pair buffer. 

Byte gathering is prohibited in the address-data pair that is cur- 
rently available to the memory controller. Thus, the first write into 
an empty Write Buffer will not have subsequent writes gathered 
into it because it is currently available for output to memory. Writes 
to the same location (byte) may be overwritten in the Write Buffer if 
the gathering is not prohibited by the preceding rule. 

The Write Buffers present address-data pairs to the main mem- 
ory controller in the sequence in which they were received from the 
processor except in the case of gathered data, where bytes or half 
words can be collected and written to main memory in a single 
write operation. If the address-data pair buffer is scheduled to be 
output, then gathering is inhibited and the buffer contents are pre- 


sented to the main memory controller. Subsequent writes are then 
placed in another buffer. No reliance should be placed in any 
aspect of gathering (except that it only involves sequential writes to 
the same word address) as it is not readily deterministic. Non- 
sequential writes to the same word address are not gathered. 

Note that gathering can require that two main memory controller 
references be used to empty a single Write Buffer entry. For exam- 
ple, this can occur if Bytes 0 and 3 of a word are sequentially writ- 
ten. Where order in writing is important, such as in I/O controllers, 
software should avoid sequential accesses to the same word. In 
cases where write-read access ordering is important but reading of 
the write location is not desired, such as during I/O, then a write fol- 
lowed by a write to a dummy location followed by a read of the 
dummy location will insur e the first write has occurred before con- 
tinuing. Alternatively, the Request signal can be tested to deter- 
mine that the Write Buffer is empty. 



Configuration Logic Connections 


Because of their byte gathering capability, each buffer device 
internally maintains a record of each valid byte in an address/data 
pair. To do this, each device must have a way of determining which 
data bits within a word it is handling. The following signals deter- 
mine how the write buffers handle data that is written to the 
devices: 


• Position 1 , Position 0 - these signals (in conjunction with 
BigEndian) determine how each Write Buffer decodes the 
Address 1/0 and AccType 1/0 to determine if it should store 
the data inputs. Refer to Figure 3 for an illustration of how 
data bits are assigned to Write Buffer devices based on their 
position. 

• BigEndian - When asserted, byte 0 is the leftmost, most 
significant byte (big-endian): when deasserted, byte 0 is the 
rightmost, least-significant byte (little-endian). 

• Address 1, Address 0 - these signals (taken from the AdrLo 
bus) must be connected to all buffer devices since they 
determine which byte within a word is being accessed. 
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• AccType 1, AccType 0 - these inputs signals specify the data Table 1 shows how these signals operate to specify how bytes 
size of a write operation as shown in Table 1 . are saved within the Write Buffers. 



Address 

1 0 

Bytes Accessed 

m Big-Endian n 

si Little-Endian n 

i i 

(word) 

0 0 



Sippi mm 




1 0 
(triple-byte) 

0 0 

0 1 

— — — 

■HBHHIlll 



0 1 
(halfword) 

m 



■■■■HlillH! 

HBHHIHil 

0 0 

(byte) 

0 0 

0 1 

1 0 

1 1 






mm! 




Table 1. Byte Specifications for Write Operations 


The lower two address bits of the device in position zero (as 
determined by the two POSITION inputs) are inhibited; that is, they 
are not stored directly as they are output on the AdrLo bus. Instead, 
on output, the lower two address bits are generated from the indi- 
cation of the positions of the valid data bytes as determined by 
above table. 

MatchOut/Matchin Logic and Read Conflicts 

Whenever the processor references main memory (either a write 
or a read reference), the Write Buffers compare the word address 
from the CPU with the word addresses stored in the buffers. If any 


word address matches, the buffers assert signals that can be used 
by the main memory controller to ensure that the Write Buffer is 
emptied before the read access with the conflicting address has 
been performed. 

Figure 7 illustrates the Write Buffer signals involved in address 
comparison logic. Each write buffer provides four output signals 
(MatchOut A, B, C, and D) which correspond to the four buffer 
ranks (A, B, C, D) in each device as shown in Figure 1. These 
MatchOut signals can be externally NANDed as shown in Figure 7 
to determine if the address being input matches those in any rank 
of the Write Buffer. 


Write Buffer 3 

MatchlnA 

MatchOutA 

MatchlnB 

MatchOutB 

MatchlnC 

MatchOutC 

MatchlnD 

MatchOutD 


Write Buffer 2 

MatchlnA 

MatchOutA 

MatchlnB 

MatchOutB 

MatchlnC 

MatchOutC 

MatchlnD 

MatchOutD 


Write Buffer 1 

MatchlnA 

MatchOutA 

MatchlnB 

MatchOutB 

MatchlnC 

MatchOutC 

MatchlnD 

MatchOutD 


MatchlnA 




MatchlnB 


ilD 3 

tchln 

> 


MatchlnC 


MatchlnD 


Write Buffer 0 


MatchlnA 

MatchlnB 

MatchlnC 

MatchlnD 

MatchOutA 



MciicrivJuio 


MatchOutD 



Oh 


To Main Memory 
Controller 



CONFLICT 


Figure 7. Write Buffer MatchOut/Matchin Logic 
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The outputs of the NAN D gates are fed into Write Buffers via the 
Matchln A, B f C, and D signals and are used within each device as 
part of the byte gathering logic. The NAND gate outputs can be 
NANDed together as shown in Figure 7 with the r esultant signal 
used (in conjunction with the processor’s MEMRD signal) to alert 
the main memory controller logic that there is a pending buffered 
write that conflicts with a just-issued read. The main memory con- 
troller can then delay the read access until the Request signal is 
deasserted indicating that the Write Buffer has been emptied. 


Error Address Latch 

The write buffer incorporates an internal latch that can be loaded 
with one of the buffered addresses and subsequently enabled out 
onto the data lines. This feature can be used by error handling rou- 
tines to read an address back from the Write Buffer and analyze or 
recover from certain bus errors. Figure 8 shows the signals in- 
volved in operation of this latch. 



When the LatchErrAddr signal is asserted, the address currently 
available to the address outputs of the Write Buffer is latched into 
the internal latch. This address can then be output on the DataOut 
lines by asserting the EnErrAdr signal so that the processor can 


read the address in as data. Refer to the AC specifications for 
timing parameters of the signals associated with the error address 
latch. 
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ABSOLUTE MAXIMUM RATINGS 0 ’ 3) 


SYMBOL 

RATING 

COMMERCIAL 

MILITARY 


Vterm 

Terminal Voltage 
with Respect to 
GND 

-0.5 to +7.0 

-0.5 to +7.0 

D 

Ta 

BUM 

0 to +70 

-55 to +125 

H 

Tbias 

Temperature 
Under Bias 

-55 to +125 

“65 to +135 

°c 

Tstg 

Storage 

Temperature* 2 * 

-55 to +125 

-65 to +150 

°c 

Vin 

Input Voltage 

-0.5 to +7.0 

-0.5 to +7.0 

V 


NOTES: 


1. Stresses greater than those listed under ABSOLUTE MAXIMUM RAT- 
INGS may cause permanent damage to the device. This is a stress rating 
only and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended pe- 
riods may affect reliability. 

2. ViN minimum = -3.0V for pulse width less than 1 5ns. Vin should not exceed 
VCC +0.5 Volts. 

3. Not more than one output should be shorted at a time. Duration of the short 
should not exceed 30 seconds. 


RECOMMENDED OPERATING 
TEMPERATURE AND SUPPLY VOLTAGE 


GRADE 

AMBIENT 

TEMPERATURE 

GND 

Vcc 

RBH 

-55°C to +125°C 

OV 

5.0 ±10% 

Commercial 

0°C to +70°C 

0V 

5.0 ±5% 


OUTPUT LOADING FOR AC TESTING 



To Device 
Under Test 


DC ELECTRICAL CHARACTERISTICS — 

COMMERCIAL TEMPERATURE RANGE <TA = o°cto+70°c,Vcc = +5.ov±5%) 


SYMBOL 

PARAMETER 

TEST CONDITIONS 

LmMaEEi 

KililUl 

mrm 

Biff ms 

mag 


UNIT 

VOH 

Output HIGH Voltage 

Vcc = Min, Ion = —4mA 

3.5 

— 

3.5 

— 

3.5 — 

3.5 



3.5 



mm 

VOL 

Output LOW Voltage 

Vcc = Min, Iol = 4mA 

— 

0.4 

— 

0.4 

— 0.4 

— 

0.4 

— 

2.4 

mm 

VlH 

Input HIGH Voltage (,) 


2.4 

— 

2.4 

— 

2.4 — 

2.4 



2.4 



mm 

Vil 

Input LOW Voltage* 21 


— 

0.8 

— 

0.8 

— 0.8 

— 

0.8 



0.8 

mm 

ClN 

Input Capacitance 


10 

— 

10 

— 

10 — 

10 

— 

10 

— 

Ol 

COUT 

Output Capacitance 


10 

— ' 

10 

— 

10 — 

10 

— 

10 

— 

mm 

Icc 

Operating Current 

Vcc = Max 

— 

50 

— 60 

— 70 

— 

80 

— 

90 

■n 

IlH 

Input HIGH Leakage 

Vih = Vcc 

— 

10 

— 

10 

— 10 

— 

10 

— 

10 

■? 

IlL 

Input LOW Leakage 

Vil = GND 

-10 

— 

-10 

— 

-10 — 

-10 

— 

-10 

— 

K9 

loz 

Output Tri-state Leakage 

Voh = 2.4V, Vol = 0.5V 

-40 

40 

-40 

40 

-40 40 

-40 

40 

-40 

40 

139 


DC ELECTRICAL CHARACTERISTICS — 

MILITARY TEMPERATURE RANGE (TA = -55°cto+i25°c, vcc = +5.ov±io%) 


SYMBOL 

PARAMETER 

TEST CONDITIONS 

16.67 MHz 
MIN. MAX. 

20.0 MHz 
MIN. MAX. 

KALliB 

33.0 MHz 
MIN. MAX. 

UNIT 

Voh 

Output HIGH Voltage 

Vcc = Min, Ioh = -4mA 

3.5 

_ 

3.5 

_ 

35 

— 

35 — 

V 

Vol 

Output LOW Voltage 

Vcc = Min, Iol = 4mA 

— 

0.4 

— 

0.4 

— 

0.4 

- 0.4 

mm 

Vih 

Input HIGH Voltage*’) 


2.4 

— 

2.4 

2.4 1 

_ 

24 

g§|_ 

mm 

Vil 

Input LOW Voltage* 2 * 


— 

0.8 

~ 0 8 

444-1! 

0.8 

i IIP 

0 8 

mm 

ClN 

Input Capacitance 


10 — : 

10 .ill — " 

10 


10 

— 

mm 

COUT 

Output Capacitance 


: : 10 — 

• 10:4’ 


10 

— 

10 

— 

mm 

Icc 

Operating Current 

Vcc * Max : 

!!. m l 

I 50 " 

— 

60 

— 

70 

— 

80 

IQJH 

IlH 

Input HIGH Leakage 

Vih * Vcc 


10 

— 

10 

— 

10 

— 

10 

mm 

IlL 

Input LOW Leakage 

Vil * GND 

-10 

— 

-10 - 

-10 

— 

-10 

— 

mm 

loz 

Output Tri-state Leakage 

Voh = 2.4V, Vol = 0.5V 

-40 

40 

-40 

40 

-40 

40 

-40 

40 

139 


NOTES: 

1 . V, H should be held above V^ + 0.5 Volts. 

2. Vil Min. = -3.0V for pulse width less than 15ns. Vil should not fall below -0.5 Volts for longer periods. 
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AC ELECTRICAL CHARACTERISTICS — 

COMMERCIAL TEMPERATURE RANGE (T a + 0°cto70°c, v cc = + 5.ov ± 5%) 


SYMBOL 

PARAMETER 


20.0 MHz 
MIN. MAX. 

tl 

Addrin (3:0) to Clock falling setup 

8 

— 

7 

— 

t2 

Addrin (3:0) from Clock falling hold 

4 

— 

4 

— 

t3 

Address 1 :0 to Clock falling setup 

8 

— 

7 

— 

t4 

Address 1 :0 from Clock falling hold 

4 

— 

4 

— 

t5 

Access Type 1 :0 to Clock rising setup 

7 

— 

6 

— 

t6 

Access Type 1 :0 from Clock rising hold 

3 

— 

3 

— 

t7 

Addrin (7:4) to Clock rising setup 

7 

— 

5 

— 

t8 

Addrin (7:4) from Clock rising hold 

3 

— 

3 

— 

t9 

Dataln (8:0) to Clock rising setup 

7 

— 

5 

— 

tio 

Dataln (8:0) from Clock rising hold 

3 

— 

3 

— 

til 

WrtMem to Clock rising setup 

10 

— 

8 

— 

tl 2 

WrtMem from Clock rising hold 

6 

— 

5 

— 

tl 3 

Request from Clock rising 

— 

32 

— 

30 

tl 4 

Acknowledge to Clock rising setup 

12 

— 

11 

— 

tl 5 

Acknowledge from Clock rising hold 

7 

— 

6 

— 

tl 6 

LatchErrAdr to Acknowledge rising 

5 

— 

5 

— 

tl 7 

WbFull active from Clock rising 

— 

21 

— 

19 

tie 

WbFull inactive from Clock rising 

— 

21 

— 

19 

tl 9 

OutEn to AddrOut (7:0), DataOut (8:0) valid 

2 

15 

2 

15 

t20 

OutEn to AddrOut (7:0), DataOut (8:0) tri-state 

2 

15 

2 

15 

t21 

MatchOut (ABCD) from Clock rising 

— 

24 

— 22 

t22 

Matchln (ABCD) to Clock rising setup 

10 

— 

9 

— 

t23 

Matchln (ABCD) from Clock rising hold 

3 

— 

3 

— 

t24 

EnErrAdrto Data (error latch) valid 

2 

15 

2 

15 

t25 

BOBBaiSHl 

2 

15 

2 

15 

t26 

Address/Data out from Clock rising 

— 

30 

— 

27 

t27 

Reset to Clock rising, set-up 

10 

— 

10 

— 

t28 

Reset from Clock rising, hold 

3 

— 

2 — 

t29 

Reset low pulse width 

8 

— 

8 

— 

t30 

WbFull High from Clock rising (after Reset) 

— 

22 

— 

21 

t31 

Request High from Reset low 

— 

20 

— 

19 

t32 

Access TypOut 1 :0 low from Reset low 

— 

28 

— 

26 

t33 

Match Out (ABCD) Low from Reset low 

— 

21 

— 

20 

t34 

Address/Data out tri-state from Reset low 
(OutEn negated) 

— 

32 

— 

30 

t35 

Access TypeOut from Clock rising 

— 

32 

— 30 

tcyc 

Clock Pulse Width 

60 

2000 

50 

2000 

tckhigh 

Clock High Pulse Width 

24 

— 

20 

— 

tcklow 

Clock Low Pulse Width 

24 

— 

20 — 



— 22 



— 

5 

— 

5 

— 

— 

17 

— 11 

2 

15 

2 

15 

— 

20 

8 — 

3 

— 

2 

15 


2 

15 

— 

24 

10 

— 

1 

— 

8 

— 

— 

20 

— 

18 

— 

25 

— 20 

— 27 

— 

27 

40 

2000 

16 

— 

16 

— 



— 11 
— 16 

— 23 

— 15 

— 23 

— 23 
30 2000 
12 — 
12 — 



— 16 ns 

8 — ns 

1 — ns 

8 — cycles 

— 11 ns 

— 16 ns 

— 23 ns 

— 15 ns 

— 23 ns 

— 23 ns 

25 2000 ns 

10 — ns 

10 — ns 
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AC ELECTRICAL CHARACTERISTICS — 

MILITARY TEMPERATURE RANGE (Ta + -55°c to i 25 °c, Vcc = +5.ov± 10%) 


SYMBOL 

PARAMETER 

16.67 MHz 
MIN. MAX. 

20.0 MHz 
MIN. MAX. 

25.0 MHz 
MIN. MAX. 

33.0 MHz 
MIN. MAX. 

UNIT 

tl 

Addrin (3:0) to Clock falling setup 

8 

— 

7 

— 

6 

— 

3 

— 

ns 

t2 

Addrin (3:0) from Clock falling hold 

4 

— 

4 

— 

4 1| 

3 

— 

ns 

t3 

Address 1 :0 to Clock falling setup 

8 

— 

7 

— 

6 f: 

3 

: — 

ns 

t4 

Address 1 :0 from Clock falling hold 

4 

— 

4 

— 

4 — 

■mg 

— 

ns 

t5 

Access Type 1 :0 to Clock rising setup 

7 

— 

6 

— 

5 v : ; 

— 

4 

— 

ns 

t6 

Access Type 1 :0 from Clock rising hold 

3 

— 

3 

— 

2 

— 

2 

— 

ns 

t7 

Addrin (7:4) to Clock rising setup 

7 

— 

5 

— 

4 — 

4 

— 

ns 

t8 

Addrin (7:4) from Clock rising hold 

3 

— 

3 

— 


ip 



ns 

t9 

Dataln (8:0) to Clock rising setup 

7 

— 

5 

— 

4 

4 

— 

ns 

tio 

Dataln (8:0) from Clock rising hold 

3 

— 

3 

— 

2 — 

1 — 

ns 

til 

WrtMem to Clock rising setup 

10 

— 

8 

— 

7 — 

6 

— 

ns 

tl 2 

WrtMem from Clock rising hold 

6 

— 

5 

— | 

I'i; 4 

— 

3 

— 

ns 

tl 3 

Request from Clock rising 

— 

32 

— 

30 

— 

22 : 


16 

ns 

tl 4 

Acknowledge to Clock rising setup 

12 

— 

11 

— | 

6 — 

4 

— 

ns 

tl 5 

Acknowledge from Clock rising hold 

7 

— 

6 

: : 5 


3 

— 

ns 

tl 6 

LatchErrAdr to Acknowledge rising 

5 

— 

5 

-||| 

5 

... — 

3 

— 

ns 

tl 7 

WbFull active from Clock rising 

— 

21 

— 

1 9 : ' | 

' - 17 

— 

9 

ns 

tl 8 

WbFull inactive from Clock rising 

— 21 

— 

19'; 

— : 11 

— 

9 

ns 

tl 9 

OutEn to AddrOut (7:0), DataOut (8:0) valid 

2 

15 

2 

is: j 

2 

15 

2 

12 

ns 

t20 

OutEn to AddrOut (7:0), DataOut (8:0) tri-state 

2 

15 

2 

15 

2 

: 15 

2 

12 

ns 

t21 

MatchOut (ABCD) from Clock rising 

— 

24 

— 

22: 

— 20 

— 

15 

ns 

t22 

Matchln (ABCD) to Clock rising setup 

10 

— 

9 

8 

— . 

5 

— 

ns 

t23 

Matchln (ABCD) from Clock rising hold 

3 

3 

3 

— 

3 

— 

ns 

t24 

EnErrAdr to Data (error latch) valid 

2 

15 

2 

15 

: 2 

15 

2 

15 

ns 

t25 

EnErrAdrto Data (error latch) tri-state 

2 

15 

2 

15 

2 

15 

2 

15 

ns 

t26 

Address/Data out from Clock rising 

— 

30 

— 

'Mth;: 

— 24 

— 

16 

ns 

t27 

Reset to Clock rising, set-up 

10 

— 

10 


10 

— 

8 

— 

ns 

t28 

Reset from Clock rising, hold 

3 

— 

2 — 

1 


1 

— 

ns 

t29 

Reset low pulse width 

8 

— 

8 llpf 

8 

If — 

8 

— 

cycles 

t30 

WbFull High from Clock rising (after Reset) 

— 

22 

— ::: 

21 

W~%:§§ 20 

— 

11 

ns 

t31 

Request High from Reset low 

— 

20 

— J§ 

19 

WmM is 


16 

ns 

t32 

Access TypOut 1 :0 low from Reset low 

— 

28 

— 

26 

■£* 25 

— 

23 

ns 

t33 

Match Out (ABCD) Low from Reset low 

— 21 

— : : 20 

— 20 

— 

15 

ns 

t34 

Address/Data out tri-state from Reset low 
(OutEn negated) 

— 

32 

• : 

30 

— 27 

— 

23 

ns 

t35 

Access TypeOut from Clock rising 

— 

32 

— 

30 

— 

27 

— 

23 

ns 

tcyc 

Clock Pulse Width 

60 

2000 

50 1 

2000 

40; 

2000 

30 

2000 

ns 

tckhigh 

Clock High Pulse Width 

24 

— 

20 


16 

— 

12 — 

ns 

tcklow 

Clock Low Pulse Width 

24 

— 

20 

— 

16 

— 

12 


ns 
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Figure 9. Write Buffer Timing Specifications 


Clock 


WTMEM 


REQUEST 


WbFull 



Acknowledge 


/\ 


Figure 10. WBFULL Signal Timing Specifications 
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OutEn 

DataOut 

AddrOut 


y 


M 


Figure'll. OUTEN Timing Specifications 


Clock 

WTMEM 

MatchOut 

(ABCD) 

Match In 

(ABCD) 

EnErrAdr 

DataOut 







Clock \ i 

' V 

J \ 


* l 26 


Address/Data X 

> 

c 

RFOIIFRT 



AcknnwIoHqo , / 

"\ 





AccTypOut X 

. . 1 

i 

( 


** t35‘ * 



Figure 13. Address/Data Out, Access Type Out 
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68-PIN CPGA FOR R3020 

PIN GRID ARRAY (CERAMIC) - BOTTOM VIEW 


ACC- 

TYPEO 


AC- 

KNOWL- 

EDGE 


AD- 

IDRESS1 


CLOCK 


DATA- 

INO 


DATA- 

IN2 


DATA- 

IN4 


DATA- 

INS 


VCC2 


GND1 


VCC1 


ACC- 
TYPE 1 


AD- 

jDRESSO 


BIG- 


EN- 


ENDIAN 


ERROR- 

ADR 


DATA- 

IN1 


DATA- 

INS 


DATA- 

INS 


GND2 


DATA- 

IN? 


ADDR- 

OUT5 


ADDR- 

OUT4 


DATA- 

INS 


ADDR- 

INO 


ADDR- 

OUT3 


ADDR- 

OUT2 


ADDR- 

IN1 


ADDR- 

IN2 


ADDR- 

OUT1 


ADDR- 

OUTO 


ADDR- 

IN3 


ADDR- 

IN4 


DATA- 

OUTS 


DATA- 

OUTO 


ADDR- 

IN5 


ADDR- 

IN6 


DATA- 

OUT1 


DATA- 

OUTS 


ADDR- 

IN7 


LATCH- 

ERR- 

ADR 


DATA- 

OUTS 


ADDR- 

OUT6 


MATCH- 

INA 


MATCH 

INB 


AD DR- 
OUT? 


ACC- 

TYPE 

OUT1 


MATCH- 

ING 


MATCH 

IND 


ACC- 

TYPE 

OUTO 


GNDO 


DATA- 

OUT? 


DATA- 

OUT4 


RE- 


QUEST 


MATCH- 

OUTC 


MATCH-1 

OUTA 


RESET 


PO- 

SITIONO 


VCC3 


GND3 


VCCO 


DATA- 

OUTS 


DATA- 

OUTS 


WBFULL 


MATCH- 

OUTD 


MATCH- 

OUTB 


WTMEM 


PO- 
SITION 1 


OUTEN 


10 
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PIN CONFIGURATION 
PLASTIC LEADED CHIP CARRIER 


(TOP VIEW) 


Q O CQ < 

» n o , t hr hr hr 

P- i— i — i— h r r □ a 
D 3 D 3 |J(» 0 0 0 0 

o<<<<p^pppo 


0 <<<<U 2 LU<<<< hr 


> Q Q Q D 15 III 5 


5 |5 lor o- o. lo > 


VSSO 

ACCTYPEOUTO 

ACCTYPE0UT1 

ADDROUT7 

ADDR0UT6 

DATAOUT3 

DATAOUT2 

DATAOUT1 

DATAOUTO 

DATAOUT8 

ADDROUTO 

ADDR0UT1 

ADDROUT2 

ADDROUT3 

ADDR0UT4 

ADDROUT5 

VSS1 



S5 « 1 
J3 a q 

cc tr z 
O Q ft 
o q o 

< < CD 


Z^IO’-ND'tiniOlM 

<0022225220 

Q0<<<<<<<<O 

u]Oo<<<<<<< 

1 


VSS3 

MATCHIND 

MATCHINC 

MATCHINB 

MATCHINA 

LATCHERRADR 

ADDRIN7 

ADDRIN6 

ADDRIN5 

ADDRIN4 

ADDRIN3 

ADDRIN2 

ADDRIN1 

ADDRINO 

DATAIN8 

DATAIN7 

VSS2 


ORDERING INFORMATION 
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IDT73200, IDT73201 

16-BIT CMOS MULTILEVEL PIPELINE REGISTERS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


PIN CONFIGURATIONS 




NC 

GND 

Y4 

Ys 

Ye 

Y7 

Vcc 

GND 

Ya 

Yg 

Yio 

Yu 

GND 


PLCC 
TOP VIEW 


PIN DESCRIPTIONS 


Pin Name 

I/O 

Description 

Do- Di5 

1 

Sixteen-bit data input port. 

Yo-Yis 

O 

Sixteen-bit data output port. 

lo — 13 

1 

Four control pins to select the register operation performed. 

SEL 0 -SEL 2 

1 

Three control pins to select the register appearing at the output. 

CLK 

1 

Clock input. 

CEN 

■ 

Clock enable control pin. When this pin is low, the instruction I 0 -I 3 is performed on the registers. 

When high, no register operation occurs. 

OE 

■ 

Output enable control pin. When this pin is high, the output port Y is in a high impedance state. 

When low, the output port Y is active. 

Vcc 


Power supply pin, 5V. 

GND 


Ground pins, 0V. 


2562 tbl 01 


IDT73200 OUTPUT SELECTION 


SEL 2 

SELi 

SELo 

Y Output 

0 

0 

0 

A -» Yo - Y 15 

0 

0 

1 

B-»Yo- Yis 

0 

1 

0 

C -» Yo- Y 15 

0 

1 

1 

D -» Yo - Y 15 

1 

0 

0 

E-» Yo- Y 15 

1 

0 

1 

F -* Yo- Yis 

1 

1 

0 

G -> Yo - Y 15 

1 

1 

1 

H -> Yo - Yis 


~ g 5 52TpT S? 


IDT73201 OUTPUT SELECTION 


SEL2 

SELi 

SELo 

Y Output 

0 

0 

0 

A -» Yo- Yis 

0 

0 

1 

B -» Yo- Yis 

0 

1 

0 

C-» Yo-Yis 

0 

1 

1 

D -> Yo- Yis 

1 

0 

m 

E -» Yo- Y 15 

1 

0 

1 

F -> Yo- Yis 

1 

1 

0 

G -> Yo- Yis 

1 

1 

1 

Do - Dis — > Yo — Yis 


■25B2T5TC3 
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IDT73200, IDT73201 

16-BIT CMOS MULTILEVEL PIPEUNE REGISTERS MILITARY AND COMMERCIAL TEMPERATURE RANGES 


IDT73200 INSTRUCTION TABLE 


D 

MM 

mm 

mm 

Mnemonic 

Function 

Pipeline Levels 

0 

D 

0 

0 

LDA 

Do - Di5 -» A 

1 

0 

wm 

0 

1 

LDB 

Do - Dis -» B 

1 

0 

0 

i 

0 

LDC 

Do - Dis -» C 

1 

0 

0 

Ik 

n 

LDD 

Do - Dis -> D 

1 

0 

1 

0 

0 

LDE 

Do - Dis -> E 

1 

MM 

n 

0 

1 

LDF 

Do - Dis -» F 

1 

0 

i 

1 

0 

LOG 

Do - Dis -» G 

1 

D 

i 

1 

1 

LDH 

Do — Dis — > H 

1 

1 

0 

mm 

0 

LSHAH 

Do "" Dis — A — ^ B — ^ C — ) D •*) E F G — > H 

8 

n 

wm 

0 

1 

LSHAD 

Do - Dis — > A — > B — > C — > D 

4 

1 

0 

i 

0 

LSHEH 

Do - Dis -» E -» F -» G -> H 

4 

n 

0 

i 

1 

LSHAB 

Do - Dis -» A B 

2 

i 

1 

0 

0 

LSHCD 

Do — D 1 5 — > G ■ — > D 

2 

i 

1 

0 

1 

LSHEF 

Do - Dis -> E -> F 

2 

i 

1 

1 

0 

LSHGH 

Do - Dis -» G -» H 

2 

i 

1 

1 

1 

HOLD 

Hold All Registers 

— 
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IDT73201 INSTRUCTION TABLE 


El 

El 

mm 

wm 

Mnemonic 

Function 

Pipeline Levels 

0 

0 

0 

0 

LDA 

Do — Di5 — > A 

1 

0 

0 

0 

1 

LDB 

Do - Dis B 

.1 

0 

0 

1 

0 

LDC 

Do — Di5 — > G 

1 

El 

El 

1 

1 

LDD 

Do - Dis -) D 

1 

0 

1 

0 

0 

LDE 

Do-Di5->E 

1 

0 

1 

0 

1 

LDF 

Do - Dis -> F 

1 

0 

1 

1 

El 

LDG 

Do - Dis G 

1 

0 

1 

1 

1 

HOLD 

Hold All Registers 

— 

1 

0 . 

0 

0 

LSHAG 

Do — Di 5 — > A — > B — > G — > D — * E *4 F — > G 

7 

1 

El 

0 

1 

LSHAD 

Do - Dis — > A — > B — > C — > D 

4 

1 

0 

1 

0 

LSHEG 

Do - Dis -» E -* F G 

3 

a 

0 

1 

1 

LSHAB 

Do - Dis — > A — > B 

2 

mm 

1 

0 

0 

LSHCD 

Do - Dis -> C -> D 

2 

i 

1 

0 

1 

LSHEF 

Do - Dis -> E -> F 

2 

i 

1 

1 

0 

LDG 

Do - Dis -> G 

1 

i 

1 

1 

1 

HOLD 

Hold All Registers 

— 
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IDT73200, IDT73201 

16-BIT CMOS MULTILEVEL PIPEUNE REGISTERS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


IDT73200 PIPELINE CONFIGURATIONS IDT73201 PIPELINE CONFIGURATIONS 


Eight 1 -Level 

| UP | 1 = 4 


ia i r 

E I 

i i=i 

4 '-5 

i bi r 

1 


l'-2 

4 1 = 6 

i ci r 

G 1 

1 1 = 3 

4 1-7 

r di r 

H 1 

Two 4-Level 

1 1 = 10 




mil 

m 



c 

mm 

HI 

m 

1 D 1 L 

_STl 


Four 2-Level 

1 1- 11 | 1 = 13 


1 A | I E | 




1 B M F | 

4-1-12 4 1 = 14 

1 C 1 1 G 1 




1 D 1 1 H | 

One 8-Level 

4 1-8 ; 

4 t 

iSKHii 

E 





■9 

II F 1 

jj 1 




■mi 

II G | 





1 D || 

rzH-j 


Seven 1 -Level 



a — i r 

E 1 


4 '- 1 . 

| 1-5 

□ 

B 1 1 

F | 


4' = 2 

| 1 = 6, 

□ 

C 1 1 

G | 


4 1 = 3 


r 

D | 



One 4-Level, One 3-Level 
^1-9 | l-io 


AZD I E | 

jlZI i f i 

cZJ 1 G _ I 

t] 


Three 2-Level 


1 A | | E | 

1 



1 B | I F | 

4 1-12 


1 C 1 



1 D 1 

One 7-Level 

4 <-e 

4 

LA 1 

1 E | 

i 




Lb 

II F | 





L cl 

1 G | 



2562drw04 
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ABSOLUTE MAXIMUM RATINGS (1) 


Symbol 

Rating 

Commercial 

Military 

E233 

Vcc 

Power Supply 
Voltage 

-0.5 to +7.0 

-0.5 to +7.0 

H 

VTERM 

Terminal Voltage 
with Respect 
to GND 

-0.5 to 

VCC + 0.5 

-0.5 to 

Vcc + 0.5 


Ta 

Operating 

Temperature 

0 to +70 

-55 to +125 

■ 

m 

Temperature 

Under Bias 

-55 to +125 

—65 to +135 

■ 

m 

Storage 

Temperature 

-55 to +125 

-65 to +155 

H 

m 

mm 

50 

50 



NOTE: 2562 ty 06 


1. Stresses greater than those listed under ABSOLUTE MAXIMUM 
RATINGS may cause permanent damage to the device. This is a stress 
rating only and functional operation of the device at these or any other 
conditions above those indicated in the operational sections of this 
specification is not implied. Exposure to absolute maximum rating 
conditions for extended periods of time may affect reliability. 


CAPACITANCE (TA = +25°C, F = 1.0MHz) 


Symbol 

Parameter* 1 * 

Conditions | 

1 J yp- 1 

m 

ClN 

Input Capacitance 

> 

o 

II 

2 

> 

mm 

■a 

Coirr 

Output Capacitance 

VOUT = 0V | 

i 12 i 

E3 


NOTE: 2562 tbt 07 

1. This parameter is sampled at initial characterization and is not 100% 


6 


TEST CIRCUIT 


Test 

Switch 

tPLZ 

Closed 

tPZL 

Closed 

Open Drain 

Closed 

All Other Tests 

Open 

DEFINITIONS: 

2562 tbl 10 


Cl = Load capacitance includes jig and probe capacitance. 

Rt = Termination should be equal to Zout of the pulse generator. 

(Typically 500) 

Vin = 0V to 3.0V 

INPUT: tr = tt = 2.5ns (10% to 90%) unless otherwise specified 
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IDT73200, IDT73201 

16-BIT CMOS MULTILEVEL PIPEUNE REGISTERS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DC ELECTRICAL CHARACTERISTICS 

Commercial: 0°C to +70°C, 5V ± 5%; Military: -55°C to +125°C, 5V ± 10% 


Symbol 

Parameter 

Test Condition 

Min. 

Max 

UQU 

VlH 

High-Level Input Voltage 

— 


— 

KM 

VlL 

Low-Level Input Voltage 

— 

— 

0.8 

V 

IlH 

High Level Input Current 

Vcc - Max. 

Vl- Vcc 

— 

10 

pA 

IlL 

Low-Level Input Current 

Vcc - Max. 

Vl - GND 

— 

-10 

pA 

VOH 

High-Level Output Voltage 

Vcc - Min., 

k)H - -8mA(COM’L.), -6mA(MIL) 

2.4 


V 

VOL 

Low-Level Output Voltage 

Vcc « Min., 

kx.- 16mA(COM’L.), 12mA(MIL.) 



V 

VlK 

Input Clamp Voltage 

ll--18mA 

— 

-1.2 

V 

im 


VCC - Max., VO = GNU 

Vi - Vcc or GND 




lOZH 

High Impedance Output 
Current 

Vcc - Max. 

Vl - Vcc 


20 

pA 

lOZL 

Low Impedance Output 
Current 

Vcc - Max. 

Vl - GND ' 


-20 

pA 


NOTES: 2562 woe 

1. For conditions shown as Min. or Max., use appropriate value based on temperature range. 

2. Not more than one output should be shorted at one time. Duration of the short circuit test should not exceed 1 00 milliseconds. 


POWER SUPPLY CHARACTERISTICS 


Symbol 

Parameter 

Test Conditions* 1 * 

Min. 

Typ- (2> 

Max. 

Unit 

Iccoc 

Quiescent Power Supply Current 

Vcc = Max. 

Vi = Vlc or Vhc 

— 

2 

10 

mA 

Iccqt (3) 

Quiescent Power Supply Current 
Inputs HIGH 

Vcc = Max. 

Vl = 3.4V 

— 

15 

45 

mA 

■ 

Dynamic Power Supply Current 

Vcc = Max. 

Outputs Disabled, OE - HIGH 
fCP = 10MHz, 50% Duty Cycle 
Vi < Vhc, Vi > Vlc 

COM’L. 

— 

10 

30 

mA 

MIL 

— 

10 

40 


Dynamic Power Supply Current 

Vcc = Max. 

Outputs Disabled, OE = HIGH 
fcp = 40MHz, 50% Duty Cycle 
Vi < Vhc, Vi > Vlc 

COM’L. 

— 

10 

60 

mA 

MIL. 

— 

10 

80 


NOTES: 2562 tw 09 

1. For conditions shown as Min. or Min., use appropriate value specified under Electrical Characteristics for the applicable device type. 

2. Typical values are at Vcc = 5.0V, +25°C ambient and maximum loading, not production tested. 

3. This parameter is not directly testable but is derived for use in the total power supply calculation. 

4. Ic = Iquiescent + Inputs + Idynamic 
Ic = ICCQC + (ICCQT X Dm X NT) + ICCO 
Iccoc = Quiescent Current 

Iccqt = Power Supply Current for a TTL High Input (Vin = 3.4V) 

Dh = Duty Cycle for each TTL Input High 
Nt = Number of TTL Inputs at Dh 

Iccd = Dynamic Charge moved by an input transition pair (HLH or LHL) 

All currents are in milliamps and all frequencies are in megahertz. 
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IDT73200, IDT73201 

16-BIT CMOS MULTILEVEL PIPELINE REGISTERS 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


AC ELECTRICAL CHARACTERISTICS 

Commerical: Ta = 0°C to +70°C, Vcc = 5V ±5%; Militaty: Ta = -55°C to +125°C, Vcc = 5V ±10% 


Parameter 

Commercial 

Military 

Unit 

73200L12 
73201 LI 2 

73200L15 
73201 LI 5 

73200L15 
73201 LI 5 

73200L20 
73201 L20 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

CLKto Y 0 -Y 15 Propagation Delay 

— 

12 

— 

15 

— 

15 

— 

20 

ns 

SELo-SEL 2 to Y 0 -Y 15 Propagation Delay 

— 

12 

— 

15 

— 

15 

— 

20 

ns 

Do-Dis to CLK Set-up Time 

3 

— 

4 

— 

4 

— 

5 

— 

ns 

Do-Di5 to CLK Hold Time 

1 

— 

2 

— 

2 

— 

3 

— 

ns 

lo— 13 to CLK Set-up Time 

4 

— 

5 

— 

5 

— 

6 

— 

ns 

lo— 13 to CLK Hold Time 

2 

— 

2 

— 

2 

— 

3 

— 

ns 

CEN to CLK Set-up Time 

4 

— 

5 

— 

5 

— 

6 

— 

ns 

CEN to CLK Hold Time 

2 

— 

2 

— 

2 

— 

3 

— 

ns 

OE Enable Time* 1 * 

— 

9 

— 

10 

— 

10 

— 

13 

ns 

OE Disable Time* 1 * 

— 

8 

— 

9 

— 

9 

— 

13 

ns 

CLK Pulse Width HIGH 

5 

— 

5 

— 

5 

— 

6 

— 

ns 

CLK Pulse Width LOW 

5 

— 

5 

— 

5 

— 

6 

— 

ns 

CLK Period 

— 

12 

— 

15 

— 

15 

— 

20 

ns 

Data In to Data Out Flowthrough* 2 * 

— 

12 

— 

15 

— 

-15 

— 

20 

ns 


NOTES: 2562 tun 

1 . Output Enable and Disable times measured to 500mV change of output voltage level. 

2. 73201 only. 


AC TEST CONDITIONS 


Input Pulse Levels 

GND to 4.0V 

Input Rise/Fall Times 

4ns 

Input Timing Reference Levels 

1.5 V 

Output Reference Levels 

1.5 V 

Output Load 

See Figure 1 


2562 tbl 12 


CMOS TESTING CONSIDERATIONS 

There are certain testing considerations which must be 
taken into account when testing high-speed CMOS devices in 
an automatic environment. These are: 

1 ) Proper decoupling at the test head is necessary. Place- 
ment of the capacitor set and the value of capacitors used 
is critical in reducing the potential erroneous failures 
resulting from large Vcc current changes. Capacitor lead 
length must be short and as close to the DUT power pins 
as possible. 

2) All input pins should be connected to a voltage potential 
during testing. If left floating, the device may begin to 
oscilliate causing improperdevice operation and possible 
latchup. 



3) Definition of input levels is very important. Since many 
inputs may change coincidentally, significant noise at 
the device pins may cause the Vil and Vih levels not to be 
met until the noise has settled. To allow for this testing/ 
board induced noise, IDT recommends using Vn<0Vand 
Vih > 3V for AC tests. 

4) Device grounding is extremely important for proper de- 
vice testing. The use of multi-layer performance boards 
with radial decoupling between power and ground planes 
is required. The ground plane must be sustained fromthe 
performance board to the DUT interface board. All 
unused interconnect pins must be properly connected to 
the ground pin. Heavy gauge stranded wire should be 
used for powerwiring and twisted pairs are recommended 
to minimize inductance. 
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IDT73200, IDT73201 

16-BIT CMOS MULTILEVEL PIPEUNE REGISTERS MILITARY AND COMMERCIAL TEMPERATURE RANGES 


ORDERING INFORMATION 


IDT 


XXXXX XX XX X 


X 


Device Power Speed Package Process/ 



Blank 

B 


12 1 
15 J 


73200 

73201 


Commercial (0°C to +70°C) 

Military (-55°C to+125°C) 

Compliant to MIL-STD-883, Class B 

Plastic DIP 

Sidebraze DIP 

Plastic Leaded Chip Carrier 


Speed in Nanoseconds 


Low Power 

16-Bit 8-Level Pipeline Register 
16-Bit 7-Level Pipeline Register 


2562 drw 06 
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FAST CMOS OCTAL 
REGISTER TRANSCEIVER 
WITH PARITY 


PRELIMINARY 

IDT73210 

IDT73211 


FEATURES: 

• Two bidirectional interfacing ports 

• Single-level pipeline register for one port and one-level 
(73211) or two-level (73210) pipeline register for the 
other port 

• 8-bit wide interface ports plus parity bit 

• Even parity checking in both directions 

• Even/odd parity generation from Port A to Port B 


Even parity generation from Port B to Port A 
Parity polarity control 

High output drive capability: 64/48mA (commercial/ 
military) 

Available in 32-pin, 300 mil plastic DIP and sidebraze 
DIP, surface mount 32-pin SOJ and LCC packages 
High-speed, low-power, CEMOS™ process technology 
Military product compliant to MIL-STD-883, Class B 



MILITARY AND COMMERCIAL TEMPERATURE RANGES 


DECEMBER 1990 


©1990 Integrated Device Technology, Inc. 
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IDT73210, IDT7321 1 

FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY MILITARY AND COMMERCIAL TEMPERATURE RANGES 


APPLICATIONS 

• Cache memory bus interface 

• Read and write buffers for RISC microprocessor system 

• Registered transceiver with parity 

FUNCTIONAL DESCRIPTION 

The IDT73210/1 Octal Register Transceivers are high- 
speed, low-power data interface with data integrity checking 
capability. 

They are designed for high-performance systems requiring 
bidirectional data transfer between two buses and maintain- 
ing error checking via parity. 

In any RISC or CISC microprocessor system, the 
I DT7321 0/1 can be used to interface cache memory with main 
memory. Data integrity is ensured through parity checking. 
Control features allow dynamic reconfiguration of 
check/generate and odd/even parity options. 

Detailed Functional Description 

Port A to Port B Path (IDT73210 and IDT73211) is 

comprised of a register (X), an even/odd parity generator and 
an ev en pa rity checker. The input data is on the Ao-8 lines. 
When AEN is low, Ao-8 is latched into Register X on the low- 
to-high CP transition. Even parity of the latched data is 
checked. If PERRA goes high, a parity error has occurred. A 
new parity bit, Bs, is gener ated. The output data bus is Bo-8 
and is enabled when BOE is low. 

Port B to Port A Path (IDT73210) is comprised of a latch 
(W), two registers (Y and Z), an even parity generator/checker 
and a parity bit latch complementer. The input data bus is on 
the Bo- 8 lines. 

When SEL is high, the incoming data is latched into Latch 
W. When LE is high, Latch W is transparent; when LE is low, 
Latch W is closed. The parity bit, Bs, can be complemented 
by the POLARITY pin. If POLARITY is low, the parity sense 
remains the same. If POLARITY is high, the parity sense is 
complemented. Parity is not generated in this path. Even 
parity of latched data is chec ked. If PERRB goes high, aparity 
error has occurred. When BEN is low, Wo-8 is latched into 
Register Z on the low-to-high C P tra nsition. The previous 
contents are held in Register Z if BEN is high or if there is no 


low-to-high C P tran sition. The output data bus is Ao-8 and is 
enabled when AOE is tow. When SEL is high, there is only a 
one clock cycle latency. 

When SEL is tow, the incoming data is la tched into Register 
Y on the low-to-high CP transition, when BEN is tow. Even 
parity of the registered data is checked. If PERRB goes high, 
a parity e rror has occurred. Even parity (QYs) is generated on 
the contents in Register Y. When BEN is low, the contents of 
register Y are tra nsferr ed to Register Z on the low-to-high CP 
transition. When BOE is tow, the content of Register Z is made 
available at output Port A. When SEL is low, there is a two 
clock cycle latency. 

Port B to Port A Path (IDT73211) is comprised of a latch 
(W), two registers (Y and Z), an even parity generator/checker 
and a parity bit latch complementer. The input data bus is on 
the Bo-8 lines. 

When SEL is high, the incoming data is latched into Latch 
W. When LE is high, Latch W is transparent; when LE is tow, 
Latch W is closed. The parity bit, Bs, can be complemented 
by the POLARITY pin. If POLARITY is tow, the parity sense 
remains the same. If POLARITY is high, the parity sense is 
complemented. Parity is not generated in this path. Even 
parity of latched data is chec ked. If PERRB goes high, a parity 
error has occurred. When BEN is tow, Wo-8 is latched into 
Register Z on the low-to-high C P tra nsition. The previous 
contents are held in Register Z if BEN is high or if there is no 
low-to-high C P tran sition. The output data bus is Ao-8 and is 
enabled when AOE is tow. When SEL is high, there is only a 
one clock cycle latency. 

When SEL is tow, the incoming data is latched into Latch Y 
when LE is high. Latch Y is closed when LE is tow. Even parity 
of latched data is checked. If PERRB goes high, a parity error 
has occurred. Eve n parit y (QYe) is generated on the contents 
in Latch Y. When BEN is tow, the contents of Latch Y are 
transf erred to Register Z on the low-to-high CP transition. 
When BOE is tow, the content of Register Z is made available 
at output Port A. When SEL is tow, there is a one clock cycle 
latency. 

The power pins are Vcc and GND 0 - 2 . GNDo is internal quiet 
ground, GNDi is Port B ground and GND 2 is Port A ground. 
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IDT73210, IDT73211 

FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


PIN CONFIGURATIONS* 1 * 



DIP/SOJ 
TOP VIEW 


SEL 

AOE 

Ao 

Ai 

A2 

A3 

A4 

Vex: 

GND 2 

As 

Ae 

A7 

As 

PERRA 

AEN 

POLARITY 


2504 drw 02 


NOTE: 

1 . GNDo is internal quiet ground 
GNDi is B Port ground 
GND 2 is A Port ground 


PIN DESCRIPTIONS 


Pin Name 

I/O 

Description 

AO-8 

I/O 

Data Port A. 

AEN 

1 

Clock enable (active low) for the register X. 

AOE 

1 

3-state output enable for Port A. 

Bo-8 

I/O 

Data Port B. 

BEN 

1 

Clock enable (active low) for the registers Y and Z. 

BOE 

1 

3-state output enable for Port B. 

LE 

1 

Latch enable input for Latch Y/Latch W of Port B. The Latch Y/Latch W is open when LE is high. Data is latched 
on the high-to-low transition of LE. 

SEL 

1 

Input selection for Port B. 

SEL - 0 Register Y; SEL = 1 Latch Y 

POLARITY 

1 

Polarity selection input. 

Polarity A to B Direction B to A Direction 

0 EVEN Pass Parity 

1 ODD Complement Parity 

PERRA 

□ 

Parity output error for Port A. 

PERRB 

o 

Parity output error for Port B. 

CP 

1 

Input clock. 

Vcc 


+5 volts. 

GNDo-2 


Ground. 


2594 tbl 01 


6.6 


3 



























IDT73210, IDT73211 

FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


OPERATING MODES SUMMARY 
IDT73210/1 A TO B DIRECTION 


Input 

Reg. X 

PERRA 

Output 

(Bs) Bo-8 

AO-8 

Ao-a -> QXo- 8 
(CP = Lo to Hi) 
(AEN = 0) 

Result of even 
parity check 

Even/odd parity bit 

Ba - POLARITY XOR 
Even parity generate 
from QXo-7 

QXo-e -» Bo-8 
(BOE = 0) 

2594 t^l 02 

IDT73210/1 B TO A DIRECTION WHEN SEL = 1 


Input 

Latch W 

PERRB 

Reg. Z 

(QZs) QZo-a 

Output 

(A8) A 0—8 

Bo-a 

Bo-a -> Wo-8 
(LE-1) 

Result of even 
parity check 

Bit complemented 
by POLARITY 
(Even/odd parity 
translation) 

Wo-a -> QZo-a 
(CP - Loto Hi) 
(BEN - 0) 

Aa - POLARITY XOR 
Wa 

QZo-a -» Ao-8 
(AOE - 0) 


2594 tbl 03 


IDT73210 B TO A DIRECTION WHEN SEL = 0 


Input 

Reg. Y 

PERRB 

Reg.Z 

(QZa) QZo-a 

Output 

(A8) A 0—8 

Bo-a 

Bo-8 -» QYo-a 
(CP = Lo to Hi) 
(BEN = 0) 

Result of even 
parity check 

Even parity generated 
bit 

QYo-a -> QZo-a 
(CP = Lo to Hi) 
(BEN = 0) 

As = Even parity 
generated from QYo-7 

QZo-a -> Ao-8 
(BOE = 0) 


2594 tbl 04 


IDT73211 B TO A DIRECTION WHEN SEL = 0 


Input 

Latch Y 

PERRB 

Reg.Z 

(QZs) QZo-a 

Output 

(As) Ao-8 

Bo-a 

Bo-a -> QYo-a 
(LE-1) 

Result of even 
parity check 

Even parity generated 
bit 

QYo- 8 -> QZo-a 
(CP = Lo to Hi) 
(BEN = 0) 

As = Even parity 
generated from QYo-7 

QZo-a -» Ao-8 
(BOE - 0) 


2594 tbl 05 


6.6 


4 






























































IDT73210, IDT73211 

FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 
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2594 drw 04 

Figure 1. R3000 System with No Parity Support in Main Memory 
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FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY 
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Figure 2. R3000 System with Parity Support in Main Memory 
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IDT73210, IDT73211 

FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


ABSOLUTE MAXIMUM RATINGS (1) 


Symbol 

Rating 

Com’l. 

Mil. 


Vterm 

Terminal Voltage 
with Respect 
to Ground 

-0.5 to 
VCC + 0.5 

-0.5 to 
Vcc + 0.5 


Vcc 

Power Supply 
Voltage 

-0.5 to +7.0 

-0.5 to +7.0 


Ta 

Operating 

Temperature 

0 to +70 

-55 to +125 


Tbias 

Temperature 
Under Bias 

-55 to +125 

-65 to +135 

°C 

Tstg 

Storage 

Temperature 

-55 to +125 

-65 to +150 

°C 

Pt 

Power Dissipation 

1.2 

1.5 

W 

lOUT 

Total Output 
Current 

200 

250 

mA 


NOTE: 2594 tbl 06 

1. Stresses greater than those listed under ABSOLUTE MAXIMUM 
RATINGS may cause permanent damage to the device. This is a stress 
rating only and functional operation of the device at these or any other 
conditions above those indicated in the operational sections of this 
specification is not implied. Exposure to absolute maximum rating 
conditions for extended periods may affect reliability. 


CAPACITANCE (Ta = +25°C, f = 1.0MHz) 


Symbol 

Parameter* 1 * 

Conditions 

Typ. 

Unit 

ClN 

Input 

Capacitance 

VlN = ov 

5 

pF 

COUT 

Output 

Capacitance 

Vout = 0V 

7 

PF 

Ci/O 

Input - Output 
Capacitance 

Vout = 0V 

7 

PF 


NOTE: 2594 tbl 07 

1 . This parameter is sampled and not 100% tested. 


DC ELECTRICAL CHARACTERISTICS OVER OPERATING RANGE 

The following conditions apply unless otherwise specified: 

Commercial: Ta = 0°C to +70°C, Vcc = 5.0V ±5%; Military: Ta = -55°C to +125°C, Vcc = 5.0V ±10% 


Symbol 

Parameter 

Test Conditions* 1 * 

Min. 

Typ.* 2 * 

Max. 

Unit 

VlH 

Input HIGH Level 

Guaranteed Logic HIGH Level 

2.0 

— 

— 

V 

VlL 

Input LOW Level 

Guaranteed Logic LOW Level 

— 

_ 

0.8 

V 

IlH 

Input HIGH Current 

Vcc = Max. 

Vi = 2.7V 

Except I/O 

— 

— 

10 

pA 

I/O pins 

— 

— 

20 

IlL 

Input LOW Current 

Vcc = Max. 

Vl = 0.5V 

Except I/O 

— 

— 

-10 

pA 

I/O pins 

— 

— 

-20 

VlK 

Clamp Diode Voltage 

Vcc = Min., In = -18mA 

— 

-0.7 

-1.2 

V 

los 

Short Circuit Current 

Vcc = Max.* 3 *, Vo = GND 

PERRA, PERRB 

-30 

— 

-150 

mA 

AO-8, BO-8 

-20 

— 

-75 

VOH 

Output HIGH Voltage 

Vcc = Min. 

Vin = Vih or Vil 

Ioh = -12mA MIL. 

Ioh = -15mA COM’L. 

H 

3.3 

■ 

V 

VOL 

Output LOW Voltage 

Vcc = Min. 

Vin = Vih or Vil 

Ao-s 

Bo-8 

Iol = 48mA MIL. 

Iol = 64mA COM'L. 

1 

0.3 

0.5 

V 

Vcc = Min. 

Vin = Vih or Vil 

PERRA 

PERRB 

Iol = 20mA MIL. 

Iol = 24mA COM'L. 

Vh 

Input Hysteresis for CP only 

Vcc = 5V 

— 

200 

— 

mV 


NOTES: 2594 tbl 09 

1 . For conditions shown as Min. or Max., use appropriate value specified under Electrical Characteristics for the applicable device type. 

2. Typical values are at Vcc = 5.0V, +25°C ambient, not production tested. 

3. Not more than one output should be shorted at one time. Duration of the short circuit test should not exceed 100 millisecond. 
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IDT73210, IDT73211 

FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


POWER SUPPLY CHARACTERISTICS 

Commercial: Ta = 0°C to +70°C, Vcc = 5.0V ±5%; Military: Ta = -55°C to +125°C, Vcc = 5.0V ±10% 
VLC = 0.2 V; VHC = VCC - 0.2V 


Symbol 

Parameter 

Test Conditions' 1 ) 

Min. 

Typ.' 2 ) 

Max. 

Unit 

Iccoc 

Quiescent Power Supply Current 

Vcc = Max., Vin = GND or Vcc 

— 

0.001 


mA 

Iccqt 

Quiescent Power Supply Current 

Vcc = Max. 

COM’L 

— 

3 

10 

mA 


TTL Inputs HIGH 

Vin = 3.4 (3) 

MIL. 

— 

3 

15 


ICCDl 

Dynamic Power Supply 

Current' 4 ) 

Vcc = Max. 

Outputs Disabled 
fcp = 10MHz 

50% Duty Cycle 
fi - 5MHz 

Vin > Vhc 

Vin < Vlc 

1 

6.0 

15 

mA 

ICCD2 

Dynamic Power Supply 

Current' 4 ) 

Vcc = Max. 

Outputs Disabled 
fCP = 40MHz 

50% Duty Cycle 
fi = 20MHz 

Vin > Vhc 

Vin £ Vlc 

1 

24 

60 

mA 


NOTES: 2594 tbi oa 

1 . For conditions shown as Min. or Max., use appropriate value specified under Electrical Characteristics for the applicable device type. 

2. Typical values are at Vcc = 5.0V, +25°C ambient and maximum loading, not production tested. 

3. This parameter is not directly testable but is derived for use in the total power supply calculation. 

4. Ic = Iquiescent + IlNPUTS + Idynamic 
Ic = Iccoc + ICCQT DhNt + ICCD 
Iccqc = Quiescent Current 

Iccqt = Power Supply Current for a TTL High Input (Vin = 3.4V) 

Dh = Duty Cycle for TTL Inputs High 
Nt = Number of TTL Inputs at DH 

I CCD = Dynamic Current caused by an Input Transition Pair (HLH or LHL) 

All currents are in milliamps and all frequencies are in megahertz. 
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IDT73210, IDT73211 

FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


SWITCHING CHARACTERISTICS OVER COMMERCIAL OPERATING RANGE 

TA = 0°C to +70°C; Vcc = 5V ± 5% 

Cl = 50pF; RL = 500fl 


Parameter 

Description 

Min. 

Typ- (1) 

Max. 

Unit 

tPLH 

tPHL 

Propagation Delay 

Clock to Ao-8 (AOE = Low) 

Clock to Bo- 8 (BOE » Low) 



10.0 

ns 

tPHL 

Propagation Delay 

CP to PERRA, PERRB 

— 

— 

8.5 

ns 

tPHL 

Propagation Delay 

POLARITY to Bo-8 

— 

— 

7.0 

ns 

tPHL 

Propagation Delay 

Bo- 8 to PERRB 

LE = High 



8.5 

ns 

ts 

Set-up Time 

Ao-8, Bo- 8, POLARITY, SEL to CP 

2.0 f! 

m*. '%r 

— 

ns 

tH 

Hold Time 

Ao-8, Bo- 8, POLARITY, SEL to CP 

.. 1.5 

— 

— 

ns 

ts 

Set-up Time 

AEN, BEN to CP Low-to-High 

2.0 

— 

— 

ns 

tH 

Hold Time 

AEN, BEN to CP Low-to-High ?• 

1.5 

— 

— 

ns 

ts 

Set-up Time 

Bo-8 to LE 

2.0 

— 

— 

ns 

tH 

Hold Time 

Bo-8 to LE 

1.5 

— 

— 

ns 

ts 

Set-up Time 1|| : . 

Bo- 8 to CP to Low-to-High; LE = High 

3.0 

— 

— 

ns 

tH 

Hold Time 

Bo- 8 to CP to Low-to-High; LE = High 

1.5 

— 

— 

ns 

tPZH 

tPZL 

Output Enable Time 

AOE to Ao- 8, BOE to Bo-8 

— 

— 

7.0 

ns 

tPHZ 

tPLZ 

Output Disable Time 

AOE to Ao-8, BOE to Bo-8 

— 

— 

6.5 

ns 

tPWH 

Clock Pulse Width High 

7.0 

5.0 

— 

ns 

tPWL 

Clock Pulse Width Low 

7.0 

5.0 

— 

ns 


NOTE: 2594 4)110 

1 . Typical values are at Vcc = 5.0V and +25°C ambient, not production tested. 
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IDT73210, IDT73211 

FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


SWITCHING CHARACTERISTICS OVER MILITARY OPERATING RANGE 

Ta = -55°C to +1 25°C; VCC = 5V ± 1 0% 

CL = 50pF; RL = 500fl 


Parameter 

Description 

Min. 

Typ. (1 > 

Max. 

Unit 

tPLH 

tPHL 

Propagation Delay 

Clock to Ao- 8 (AOE = Low) 

Clock to Bo- 8 (BOE « Low) 



12.2 

ns 

tPHL 

Propagation Delay 

CP to PERRA, PERRB 

— 

— 

10.6 

ns 

tPHL 

Propagation Delay 

POLARITY to Bo-8 

— 

— 

7.0 

ns 

tPHL 

Propagation Delay 

Bo- 8 to PERRB 

LE = High 



10.6 

ns 

ts 

Set-up Time 

Ao-a, Bo- 8, POLARITY, SELto CP 

2.0 

. — 

— 

ns 

tH 

Hold Time 

Ao-a, Bo- 8, POLARITY, SEL to CP 

1.5 

— 

— 

ns 

ts 

Set-up Time 

AEN, BEN to CP Low-to-High 

IPfl, 2.0 

— 

— 

ns 

tH 

Hold Time % 

AEN, BEN to CP Low-to-High ? 

1.5 

— 

— 

ns 

ts 

Set-up Time 

Bo- 8 to LE 

2.0 

— 

— 

ns 

tH 

Hold Time 

Bo- 8 to LE 

1.5 

— 

— 

ns 

ts 

Set-up Time 

Bo-a to CP to Low-to-High; LE = High 

3.0 

— 

— 

ns 

tH 

Hold Time 

Bo-8 to CP to Low-to-High; LE = High 

1.5 

— 

— 

ns 


Output Enable Time 

AOE to Ao- 8, BOE to Bo-e 

— 

— 

7.0 

ns 

tPHZ 

tPLZ 

Output Disable Time 

AOE to Ao- 8, BOE to Bo-a 

— 

— 

6.5 

ns 

tPWH 

Clock Pulse Width High 

8 

6 

— 

ns 

tPWL 

Clock Pulse Width Low 

8 

6 

— 

ns 


NOTE: 2594 tbl 1 1 

1 . Typical values are at Vcc = 5.0V and +25°C ambient. 
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IDT73210, IDT7321 1 

FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 



Vcc 



O OUTPUTS 


Figure 5. Output Interface Circuit 


Vcc + 7.0V 



DEFINITIONS: 

Cl = Load capacitance: includes jig and probe capacitance 

Rl = Termination resistance: should be equal to Zout of the Pulse Generator 

Figure 6. AC Test Load Circuit 


AC TEST CONDITIONS 


Input Pulse Levels 

GND to 3.0V 

Input Rise/Fall Times 

1 V/ns 

Input Timing Reference Levels 

1.5V 

Output Reference Levels 

1.5V 

Output Load 

See Figure 6 


2594 tbt 12 


Test 

Switch 

Open Drain 

Closed 

Disable Low 


Enable Low 


All other Tests 

Open 
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IDT73210, IDT73211 

FAST CMOS OCTAL REGISTER TRANSCEIVER WITH PARITY 


MILITARY AND COMMERCIAL TEMPERATURE RANGES 


ORDERING INFORMATION 

IDT XXXX X X 

Device Type Package Process/ 


BLANK Commercial (0°C to +70°C) 

B Military (-55°C to +1 25°C) 

Compliant to MIL-STD-883, Class B 

Y 32-pin Small Outline 1C (J-Bend) 

TP 32-pin Thin Plastic Dip (300mil wide) 

7321 0 8-bit One Single, One Double Pipeline Registers 

7321 1 8-bit Two Single Pipeline Registers 

2594 drw 10 
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PACKAGE DIAGRAM OUTLINES 





RISC PROCESSING COMPONENTS 



RISC SUPPORT COMPONENTS 



RISC MODULE PRODUCTS 


RISC DEVELOPMENT SUPPORT 




APPLICATION MOTES 



RISC MODULE PRODUCTS 


Maximizing the performance of R3000 systems means 
designing with very high-speed components and finely tuning 
PC board layouts to work at very high clock rates. IDT offers 
a variety of pre-built, pre-tested RISC subsystems that can be 
used to eliminate this part of the design task. 

Roughly three by six inches in size, the modules are built on 
8-10 layer PC boards with components surface mounted on 
both sides. Most modules include at least the CPU, optional 
FPA, cache RAMs and Read/Write Buffers. The high-speed 
clock is also on the module, along with some reset, interrupt 
and control logic. The net effect is to put all of the very high- 
speed logic onto a tightly integrated, independent subsystem 


that can be purchased like a component. The modules are 
100% burned-in and tested at the rated speed. 

The modules are designed around several different archi- 
tectures. Within a given architecture, there is a range of 
speeds and cache sizes to choose from. All the modules with 
the same architecture are plug compatible, so price/perform- 
ance options are easy to offer in the end system by simply 
selecting the appropriate RISC Subsystem module. 

Prototyping development systems are available for each 
module architecture to serve as a starting point for additional 
hardware and software development. 
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Integrated Device Technology, Inc. 


R3000 CPU MODULES 
FOR GENERAL APPLICATIONS 


IDT7RS1 01 


FEATURES: 

• R3000 CPU on 3.7" x 6.5" plug-ln module 

• 64K each Instruction and Data Caches 

• On-board clock generation 

• Four-word read buffer for block refill. Single 
word write buffer 

• On-board parity generation 

• Five user Interrupts into on-board register 

• Available with or without Floating Point 
Accelerator 

< Cache supports full 32-bit address space 

• 100% burn-in and functional test at rated speed 


POWERFUL GENERAL PURPOSE R3000 
MODULE: 

The IDT7RS101 is a complete reduced instruction set 
computer (RISC) CPU, based on the MIPS R3000 RISC 
processor, and supplied on a small fully-tested high-density 
plug-in module. The module includes the R3000 CPU, 64 
Kbytes each of data and instruction cache memory, a single 
word write buffer and a four-word read buffer to support 
block refill. Clock generation, reset, control and interrupt 
functions are included on the module to simplify the remain- 
der of the system design. Parity bits on incoming data words 
may be generated automatically on the module, transparent 
to the rest of the system. Alternatively, parity may be 
handled in the user system, with on-board circuits only 
performing optional parity check functions. Five user inter- , 
rupts are provided with an on-board clocked register to 1 
ensure synchronized activity with the R3000 timings. 

The module is constructed using surface mount devices 
on a 3.72 by 6.5” epoxy laminate board, and is connected to 
the user’s system via four 50-pin Insulation Displacement 
Connectors. 



7RS101 Module. Actual Size 3.72" x 6.50 " 


RSD lOIpb/A/R 


OCTOBER 1990 


©1 990 Integrals) Device Technology, Inc. 
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IDT7RS101 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


ARCHITECTURAL HIGHLIGHTS 
Four-word Read Buffer 

The 7RS101 includes a single write buffer and a four- 
word deep read buffer. Cache read miss operations (memory 
data not currently stored in the cache) may be handled either 
with a four-word block refill or with a single-word fetch and 
cache update. All control signals are available to implement 
either option. Address mapping can be used to force block 
refills on some addresses (for example, instructions) and 
single-word updates on other addresses (forexample.data). 

Clock Generation 

The clocks for both the R3000 and the R3010 are auto- 
matically generated on the module using a very accurate 
and stable delay line driven by a single user-supplied input 
clock signal. There are three buffered clock output signals 
for use with external control logic and system timing, each 
of which is an identical inverted version of the R3000 output, 
SYSOUT#. 

Parity Generation 

The R3000 Processor requires incoming data words to 
consist of 32 bits of data and 4 bits of parity. The 7RS101 
module can be set to either of two modes for parity handling. 
It can check parity and report errors on incoming data that 
consists of 32 data bits and 4 parity bits. In the other mode, 
it can generate parity on 32 bits of incoming data and supply 
the full 36 bits to the CPU. 


Initialization Options 

The R3000 requires mode selection to be made during 
the RESET initialization sequence. The 7RS101 module 
provides three pins that can simply be tied High or Low by 
the user to select some of the R3000’s options: Instruction 
Streaming on or off, Partial Word Store on or off, and 
BigEndian or LittleEndian byte order. 

User Interrupts 

Six user interrupt inputs are provided. Each of these is a 
negative-true signal, terminated with a 4.7K ohm pull-up 
load resistoron the module, so pins may be left unconnected 
if they are not used. The interrupt signals are clocked into 
an Interrupt Input Register on the module by the CPU clock 
SYSOUT. This ensures that the interrupt inputs to the 
R3000 are synchronized to its clock. One of the interrupts 
is a reserved for use by the R3010 FPA; the other 5 are 
available for the user. 

Buffered Outputs 

The address and data lines coming out of the module are 
buffered, and can support substantial bus drive. All control 
signals except those coming directly from the R3000 or 
R3010 are also buffered. 
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IDT7RS101 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


SIGNALS PROVIDED ON MODULE PINS 


Signal 

Type 

Functional Description 

MD00-MD31 

I/O 

Memory data lines to/from main memory system. 

PAR0-PAR3 

I/O 

Parity bits for data lines. Unconnected if on-board parity generation selected. 

MA00-MA31 

OUT 

Memory address lines to main memory system. These are registered outputs. 

CA2-CA3 

OUT 

Block refill counter outputs. These lines are normally used instead of MA02-MA03, since they are the 
outputs of the counter used to implement the 4-word block refill function. 

RACT(0:1) 

OUT 

Positive-true outputs indicating the states of the R3000 outputs, ACCTYP(0:1 ), which identify the 
size of data transactions for read/write cycles. These are registered outputs, like MA00-MA31 . 

BACT2 

OUT 

Buffered ACCTYP(2) output from R3000, used to distinguish between cache and non-cache memory 
operations. 

AOE# 

IN 

Negative-true input to enable the 3-state register output pins, MAOO-MA31 and RACT(0:1). 

UINT0-UINT5 

IN 

User interrupt inputs. Each has a 4.7Kohm pullup load resistor. UINT1 is reserved for R3010 FPA 
usage. 

OSCIN 

IN 

Oscillator input clock signal. (2x clock rate). 

MRES# 

IN 

Negative-true master reset input. 

RESSW1-3 

IN 

Mode selection inputs used to determine R3000 setup options during reset initialization sequence. 
Each has a 4.7K ohm pullup load resistor. Jumpers or switches to ground select the desired options. 

SYSOUT1 -3 

OUT 

Buffered clock outputs for synchronizing external events. Each of these is an identical clock signal, 
representing the inverted form of the R3000 output, SYSOUT#. 

MEMRD# 

OUT 

Direct negative-true output from R3000, used to indicate that a memory read cycle is in progress. 

MEMWR# 

OUT 

Direct negative-true output from R3000, used to indicate that a memory write cycle is in progress. 

RBSY 

IN 

Positive-true input used to request a memory read stall initiation and termination. This signal is 
normally held in its asserted state and de-asserted at the completion of CPU stalls. 

WBSY# 

IN 

Negative-true input used to request a busy indication for subsequent memory write operations. 

BLKR# 

IN 

Negative-true input used to request a block read sequence for read operations from main memory. 

AEN# 

IN 

Negative-true input used to enable the clock for loading the address register. 

CEN 

IN 

Positive-true input used to enable the increment of the block refill address counter for pins CA2-CA3. 

PHOLD 

IN 

Positive-true input used to inhibit clocking of the read buffer. This signal is normally the complement 
of the CEN input. 

WOE# 

IN 

Negative-true input used to enable the data output drivers for main memory write cycles. 

WCTL# 

IN 

Negative-true input used to enable the clock to load data into the write data register for main memory, 
write cycles. 

CPCO 

IN 

Direct input to R3000 Processor, used to indicate the size of the data (block, word, byte, or other) for 
memory read cycles. 

CPC1 

OUT 

Connection between the R3000 Processor and the R301 0 FPA. Indicates the status of the conditional 
branch. This pin is provided for diagnostic purposes, only. 

EXC# 

OUT 

Direct output from R3000, indicating the EXC# signal between the R3000 and the R3010. 

RUN# 

OUT 

Negative-true output from the R3000, indicating that the R3000 is in its RUN state (not stalled). 

BERR# 

IN 

Negative-true input to R3000 (with 4.7K ohm pullup), indicating a bus error in main memory. 

FPA# 

OUT 

Negative-true output from R3010, indicating the presence of R3010 FPA on the module. 

POE# 

IN 

Negative-true input with 4.7K ohm pullup resistor, used to enable the on-board parity generation logic. 

It is left unconnected if parity is to be handled by user system. 

PERR# 

OUT 

Negative-true output which indicates a parity error on incoming data when on-board parity generation 
is not selected. 

TAGV 

OUT 

Tag validity indicator, connected between R3000 and cache memory. Provided for diagnostic 
purposes, only. 




























































































IDT7RS101 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


RELATED PRODUCTS 
Prototyping System 

The 7RS101 module can be placed into immediate ser- 
vice using our flexible 7RS301 Prototyping Platform. The 
system includes two boards: a general purpose CPU board, 
and a personality card that interfaces the module to the C PU 
board. 

The CPU board contains 1 Mb of main memory, 256K of 
EPROM, two RS232 serial ports, an 8254 counter/timer, 
and an 8-bit parallel port accessible through a dual port 
RAM. Four 50-pin connectors provide access to all the 
address, data, and control signals for external connection to 
additional hardware on, for example, a wire-wrap board. 


The system includes IDT’s Software Integration Man- 
ager, which provides facilities for downloading code, 
examining memory, and stepping through programs. 

The personality card is on a separate board, and provides 
a bed for the module, necessary control signals, and con- 
nectors for an HP 16500 Logic Analyzer. 

Code for the R3000 can be created on a MIPS develop- 
ment system, on IDT's MacStation™ system, or using IDT’s 
PC-based cross assembler and compiler products. As- 
sembled code can be downloaded into the Prototyping 
System for execution and debug. 



ORDERING INFORMATION 


Ordering Part Number 

CPU 

FPA 

l-cache 

D-cache 

Speed 

Other 

7RS101F66A16A 

R3000A 

R3010A 

64K 

64K 

16 MHz 


7RS101F66A20A 

R3000A 

R3010A 

64K 

64K 

20 MHz 


7RS101F66A25A 

R3000A 

R3010A 

64K 

64K 

25 MHz 


7RS101F66A30A 

R3000A 

R3010A 

64K 

64K 

30 MHz 


7RS101N66A16A 

R3000A 

None 

64K 

64K 

16 MHz 


7RS101N66A20A 

R3000A 

None 

64K 

64K 

20 MHz 


7RS101N66A25A 

R3000A 

None 

64K 

64K 

25 MHz 


7RS101N66A30A 

R3000A 

None 

64K 

64K 

30 MHz 




ADDITIONAL INFORMATION 

For detailed technical specifications on this module refer to the 7RS101 Product Specification and User’s Manual. 

CUSTOM OPTIONS 

Some features of the 7RS101 can be modified on special order. Contact your IDT sales office for information. 
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FEATURES: 

• Cache Size: 16K Instruction, 16K Data 

• Extremely small size: 3.2" x 3.9" 

• Processor Speeds up to 25 MHz 

• Includes R3010 Floating Point Accelerator 

• Single word Read and Write Buffers 

• 100% burn-in and functional test at rated speed 


R3000 MODULE FOR GENERAL USE IN 
SMALL SYSTEMS: 

The IDT7RS102 is a complete reduced instruction set 
computer (RISC) CPU, based on the MIPS R3000 RISC 
processor, and supplied on a small fully-tested high-density 
plug-in module. The module includes the R3000 CPU, the 
R3010 Floating Point Accelerator, 16 Kbytes each of data 
and instruction cache memory, a single word read buffer and 
a single word write buffer. 

Cache misses are handled with single word read re- 
quests to memory, providing a simple interface to any type 
of main memory system. 

The module is constructed using surface mount devices 
on a 3.2" by 3.9" epoxy laminate board, and is connected to 
the user’s system via 1 44 pins located in two pin row regions 
on the board. 



7RS1 02 Module. Actual Size 3.2" x 3.9 


DECEMBER 1990 


RSD PB102/A/R 


©1990 Integrated Device Technology, Inc. 
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IDT7RS102 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


ARCHITECTURAL HIGHLIGHTS 
Small and Simple 

The 7RS102 module is designed to be as small as 
possible and to provide a simple interface to the user's 
system. The 16K caches are the smallest useful for most 
systems. 

R3010 Floating Point Accelerator 

The R3010 Floating Point Accelerator (FPA) is included 
as an integral part of the module. It operates in conjunction 
with the R3000 RISC Processor and greatly improves the 
system performance by expanding the instruction set to 
include very fast floating point capabilities. All timing and 
control connections are on the module and are completely 
transparent to the user. 

Clock Generation 

The clock inputs to the 7RS1 02 are the direct connections 
to the clocks for both the R3000 and the R3010. These 
clocks must be generated in the user system and applied to 
the module. 

Cache Memory 

Cache memory is provided on the module for a capacity 
of 16K bytes for each of the two R3000 cache memory 
systems (Instruction Cache and Data Cache). Memory 
operations which require main memory data transfers are 
conveniently handled by means of a variety of on-board 
control signals. 

Cache read miss operations are handled as single-word 
fetch and cache update. Non-cache read operations (such 
as I/O reads) are indicated by means of control signals and 
are easily accomodated by the user. 


Parity Generation 

The R3000 Processor requires incoming data words to 
consist of 32 bits of data and a 4-bit parity code. Each of the 
4 parity bits applies to a particular byte in the word. The 
required parity is even. The user system is required to 
generate parity for incoming data to the module and may 
optionally check parity fordata being passed to main memory. 

Address and Data Buffers 

The address and data lines coming out of the module are 
buffered and can support substantial drive requirements. 
The address pins are direct outputs from registers and 
include the signals MACT0-MACT2. The three-state output 
drivers may be disabled by de-asserting the output enable 
control line, AOE. This is not normally done, but is provided 
as a feature for systems which may require it. 

The data pins are driven by Bi-directional Registers. 
Enable/disable control of the three-state output drivers is 
accomplished with the signal, DOE. Memory write cycles 
utilize a single-word write buffer on the module which 
permits the R3000 Processorto continue running while data 
is being written into main memory. USER INTERRUPT 
INPUTS 

R3000 User Interrupts 

Six user interrupt inputs are provided, INT0-INT5. Each 
of these is a negative-true signal, terminated with a 1 0K ohm 
pullup load resistor on the module. In this way, the pin may 
be left unconnected if it is not to be used. The interrupt 
signals are connected directly to the interrupt pins of the 
R3000 Processor. INTI is a reserved pin on this version of 
the module and is required for use by the R301 0 FPA. As a 
result, it may not be used and must be left unconnected. 
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IDT7RS102 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


FUNCTIONAL BLOCK DIAGRAM 


CPU Controls 
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IDT7RS102 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


SIGNALS PROVIDED ON MODULE PINS 


Signal Name 

Type 

Description 

MDOO-MD31 

I/O 

Memory data lines to/from main memory system. 

MPAR0-MPAR3 

I/O 

Parity bits for data lines. Parity must be supplied to the 7RS1 02 module, in accordance with R3000 
requirements. 

MA00-MA31 

OUT 

Memory address lines to the main memory system. These are registered outputs. 

MACT0-MACT2 

OUT 

Positive-true outputs indicating the states of the R3000 outputs, ACCTYP(0:2), which identify the 
nature of the data transactions for read/write cycles. These are registered outputs, like MA00-MA31 . 

MTAGV 

OUT 

Registered TAGV output from R3000. 

ACCTYP2 

OUT 

Unbuffered ACCTYP(2) output from R3000, used to distinguish between cache and non-cache 
memory operations. 

AOE# 

IN 

Negative-true input to enable the three-state register output pins, MAOO-MA31 , MACT0-MACT2, and 
MTAGV. 

INT0-INT5 

IN 

Interrupt inputs. Each has a 10K ohm pullup loadresistor. INTI is reserved for R3010 FPA usage. 

CLK2XPHI 

IN 

Clock input for R3000 and R3010. Timings must conform to R3000 specifications. 

CLK2XRD 

IN 

Clock input for R3000 and R3010. Timings must conform to R3000 specifications. 

CLK2XSYS 

IN 

Clock input for R3000 and R3010. Timings must conform to R3000 specifications. 

CLK2XSMP 

IN 

Clock input for R3000 and R3010. Timings must conform to R3000 specifications. 

MRES# 

IN 

Negative-true master reset input. Connects directly to R3000 RES# input pin. 

SYSOUT1-3 

IN 

Buffered clock outputs for synchronizing external events. Each of these is an identical clock signal, 
representing the inverted form of the R3000 output, SYSOUT#. 

MEMRD# 

OUT 

Direct negative-true output from R3000, used to indicate that a memory read cycle is in progress. 

MEMWR# 

OUT 

Direct negative-true output from R3000, used to indicate that a memory write cycle is in progress. 

RBSY 

IN 

Positive-true input used to request a memory read stall initiation and termination. This signal is 
normally held in its asserted state and deasserted at the completion of stalls. 

WBSY# 

IN 

Negative-true input used to request a busy indication for subsequent memory write operations. 

ACE# 

IN 

Negative-true input used toenable the clock for loading the address register. 

RDEN# 

IN 

Negative-true input used toenable the clock for loading the data register for memory read cycles.. 

DOE# 

IN 

Negative-true input used to enable the three-state data outputs, MD00-MD31 and MPAR0-MPAR3. 

WCTL# 

IN 

Negative-true input used to enable the clock to load data into the write data register for main memory 
write cycles. 

CPCO 

IN 

Direct input to the R3000, used to indicate tha size of the data (block, word, byte, or other) for memory 
read cycles. 

CPC1 

OUT 

Connection between the R3000 Processor and the R301 0 FPA, indicating the status of the conditional 
branch. This pin is provided for diagnostic purposes, only. 

CPC2, CPC3 

I/O 

Direct connections to R3000 pins. 

EXC# 

OUT 

Direct output from R3000, indicating the EXC# signal between the R3000 and the R3010. 

RUN# 

OUT 

Negative-true output from the R3000, indicating that the R3000 is in its RUN state (not stalled). 

BERR# 

IN 

Negative-true input to R3000, used to indicate a bus error in main memory. 

FPA# 

OUT 

Negative-true output fro R3010, indicating the presence of R3010 FPA on the module. 
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IDT7RS102 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


RELATED PRODUCTS 
Prototyping System 

The 7RS102 module can be placed into immediate ser- 
vice using our flexible 7RS302 Prototyping Platform. The 
system includes two boards: a general purpose CPU board, 
and a personality card that interfaces the module to the CPU 
board. 

The CPU board contains 1Mb of main memory, 256K of 
EPROM, two RS232 serial ports, an 8254 counter/timer, 
and an 8-bit parallel port accessible through a dual port 
RAM. Four 50-pin connectors provide access to all the 
address, data, and control signals for external connection to 
additional hardware on, for example, a wire-wrap board. 


The system includes IDT’s IDT/sim System Integration 
Manager, which provides facilities for downloading code, 
examining memory, and stepping through programs. 

The personality card is on a separate board, and provides 
a bed for the module, necessary control signals, and con- 
nectors for an HP 16500 Logic Analyzer. 

Code for the R3000 can be created on a MIPS develop- 
ment system, on IDT's MacStation™ system, or using IDT’s 
PC-based cross assembler and compiler products. As- 
sembled code can be downloaded into the Prototyping 
System for execution and debug. 



Module Prototyping Platform. 

The card on the left is the personality card with a module; the card on the right Is the general purpose CPU. 

ORDERING INFORMATION 


Ordering Part Number 

CPU 

FPA 

l-cache 

D-cache 

Speed 

Other 

7RS102-16A 

R3000 

R3010 

16K 

16K 

16 MHz 


7RS102-20A 

R3000 

R3010 

16K 

16K 

20 MHz 


7RS102-25A 

R3000 

R3010 

16K 

16K 

25 MHz 


7RS102F16A 

R3000 

R3010 

16K 

16K 

16 MHz 


7RS102F20A 

R3000 

R3010 

16K 

16K 

20 MHz 


7RS102F25A 

R3000 

R3010 

16K 

16K 

25 MHz 



NOTE: 


1. 7RS1 10 module recommended for new designs. 

ADDITIONAL INFORMATION 

For more details on the 7RS102 module, refer to the 7RS1 02 Technical Specification and User's Manual. 
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IDT7RS103 



Integrated Device Technology, Inc. 


R3000 CPU MODULES 
FOR COMPACT SYSTEMS 


DISTINCTIVE FEATURES: 

• Three Cache Size Versions: 

16K Instruction, 16K Data (7RS1 03-44) 

32K Instruction, 32K Data (7RS1 03-55) 

64K Instruction, 64K Data (7RS1 03-66) 

• Extremely small size: 2.9" x 3.7" 

• Processor speeds up to 25 MHz 

• Optional R3010 Floating Point Accelerator 

• On-board delay line to create R3000 clocks 

• 100% burn-in and functional test at rated speed 


R3000 MODULE FOR COMPACT HIGH 
PERFORMANCE SYSTEMS: 

The IDT7RS103 is a family of interchangeable RISC CPU 
Subsystem modules, based on the MIPS R3000 RISC pro- 
cessor, and supplied on small fully-tested high-density plug-in 
modules. The module includes the R3000 CPU, optionally the 
R3010 Floating Point Accelerator, and data and instruction 
cache memory. Versions are available with 1 6K each I and D 
cache (7RS103-44), 32K each I and D cache (7RS103-55) 
and 64K each I and D cache (7RS1 03-66) . The three versions 
differ only in the length of the board. All plug into the same 
socket. The delay line to generate the three R3000 2x clock 
signals is included on the module, so the module can be driven 
from a single 2x clock. 

Externally, the user system supplies the R3000 control 
signals and the read and write buffers. , 

The module is constructed using surface mount devices on i 
both sides of a 2.9" epoxy laminate board, and is connected 
to the user’s system via 1 92 pins located in two pin row regions 
on the board. 



7RS103-44 Module. Actual Size 2.9" x 3.7" 


RSD P9103/B/H 


DECEMBER 1990 


©1990 Integrated Device Technology, Inc. 
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IDT7RS103 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


ARCHITECTURAL HIGHLIGHTS: 

The Minimal Module 

The 7RS103 is designed to provide an R3000 RISC 
Subsystem in as small a space as possible. It includes only 
the CPU (and FPA), cache memories, and a delay line to 
generate the 2x clocks to the R3000. The read and write 
buffers and control logic are handled off the module by the 
user's system. This makes the module ideal for use with 
ASICs or other unique implementations of main memory 
interface. 


The R3000 timing and control signals are brought directly 
off the module. The R3000 data sheet should be consulted for 
all the timing specifications. One of the interrupt inputs is 
required by the R3010 on versions that include the FPA 
device. 

The three versions of the 7RS103 differ only in the cache 
memory sizes. They are completely interchangeable. 


FUNCTIONAL BLOCK DIAGRAM 


ACCTYP + 


CTRL ADDRLO TAG DATA 



EXTERNAL 

OSCILLATOR 
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IDT7RS104 

Integrated Device Technology, Inc. 

1 



FEATURES: 

• 128K each of Instruction and Data RAM in 
Synchronous Memory 

• Simple Interface to peripherals 

• On-Board Dual UART 

• On-Board DMA Control 

• Includes IDT/sim Monitor in EPROM 

• On-Board 8254 Counter/Timer 

• On-board oscillator, delay line, and reset 
circuitry 

• 100% burn-in and functional test at rated speed 


R3001 MODULE WITH SYNCHRONOUS 
MEMORY 

The IDT7RS1 04 is a complete reduced instruction set com- 
puter (RISC) CPU, based on the IDT 79R3001, an IDT 
derivative of the MIPS R3000 RISC processor, and supplied 
on a small fully-tested high-density plug-in module. The 
module includes the CPU, optionally the R3010 Floating 
Point Accelerator, and 256K of synchronous memory di- 
vided into 128K each for Instruction and data space. Clock 
generation, reset, control and interrupt functions are in- 
cluded on the module to simplify the remainderof the system 
design. 

The 104 module takes advantage of the R3001’s ability to 
address large amounts of synchronous memory, permitting 
the entire application program to reside in high-speed memory 
space. 

The module is constructed using surface mount devices on 
a 3.7" by 6.1" epoxy laminate board, and is connected to the 
user’s system via pins located in two pin row regions on the 
board. 
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7RS104 Module. Actual Size: 3.7" x 6.1" 


DECEMBER 1990 


RSD PB104/A/R 


©1990 Integrated Device Technology, Inc. 
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IDT7RS104 

R3001 RISController™ ENGINE 


PRODUCT BRIEF 


ARCHITECTURAL HIGHLIGHTS 
Complete RISC Computer 

The IDT7RS1 04 is a complete reduced instruction set com- 
puter (RISC) module optimized for embedded control appli- 
cations. It is based on the R3001 RISController and in- 
cludes the R3010 Floating Point Accelerator (FPA) to en- 
hance performance. On-board EPROM and serial I/O allow 
the module to perform as a stand-alone computer. All logic 
and control functions have been incorporated in the module 
in order to simplify system design for the end user. An 
82C54 Programmable Interval Timer is included for periodic 
interrupt functions. 

R3001 CPU 

The module uses IDT's R3001 CPU device. The R3001 is 
architecturally the same as the R3000, but permits disabling 
of parity checking, thereby eliminating several bits of width 
in the synchronous memories. It is also capable of support- 
ing synchronous memory spaces larger than the 256 KB 
maximum supported by the R3000. Like the R3000, the 
R3001 uses the R3010 Floating Point Accelerator for high 
speed arithmetic. 

Simple Interface 

For ease of use, all complex clock timing signals and 
handshaking logic are generated on the module and are 
derived from either an on-board oscillator, or an externally- 
supplied clock input signal. This frees the system designer 
from the task of having to design a state machine which 
implements the system’s handshake and arbitration func- 
tions. 

Large Synchronous Memory Space 

A large "cache" memory is included on the module with 1 28K 
byte capacity for the Instruction Cache and 128K byte 
capacity for the Data Cache. This large cache allows most 
embedded control software to run completely within the 
cache, making external asynchronous memory unneces- 
sary. 

In fact, the 7RS1 04 "cache" memories are intended to be the 
machine's main memory. They are initially filled with in- 
structions by using code in the EPROMs to move data from 
module's bus into memory using Load and Store instruc- 
tions. 

Reset and Initialization Logic 

A master reset input triggers special initialization logic which 
is used to set the modes of operation of the R3001 with no 
user intervention required. The module is reset to Bigendian. 


Two Synchronized User Interrupts 

There are two pins for user interrupts provided on the 
module. They feed an on-board locked register to ensure 
synchronized activity with the R3000. The remaining four 
R3000 interrupts are used on the board: two interrupts are 
dedicated to the 82C54 Timer, one interrupt is dedicated to 
the 2681 DUART, and another interrupt is connected to the 
R3010 Floating Point Accelerator. 

TYPICAL APPLICATIONS 

The 7RS104 Module is designed to be a complete R3000 
based controller, with a simple asynchronous handshaking 
to external I/O devices or possibly additional memory. The 
two 1 28 KB synchronous memory blocks on the module are 
intended to be large enough to store all the instructions and 
data used by the machine, so there are no cache miss stalls. 
This design not only allows the system to run at maximum 
performance, but also eliminates the inconsistencies in 
execution speed that result from cache misses and refills in 
conventional R3000 designs. 
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IDT7RS103 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


SIGNALS PROVIDED ON MODULE PINS 


Pin Name 

Type 

Functional Description 

D00-D31 

I/O 

Memory data lines to/from main memory system. 

DP0-DP3 

I/O 

Parity bits for data lines. Parity must be supplied to the 7RS103 module, in accordance with R3000 
requirements. 

A00-A15 

OUT 

Address lines from the R3000. (lower 16 bits) 

T16-T31 

OUT 

Tag lines from the R3000. (higher 1 6 bits) 

ACCTYPO- 

OUT 

Positive-true outputs indicating the states of the R3000 ACCTYP2outputs, ACCTYP(0:2), which 
identify the nature of the data transactions for read/write cycles. 

TAGV 

OUT 

Connection between cache and R3000. 

INT0-INT5 

IN 

Interrupt inputs. Each has a 10K ohm pullup load resistor. 

EXTOSC 

IN 

External oscillator input (Needed when using on-board delay line) 

ECLKHPI 



ECLKRD SMP 



ECLKSYS 

IN 

The 2x Clock inputs for R3000 and R3010. Timings must conform to R3000 specifications. 

(Needed when not using on board delay line) 

MRES# 

IN 

Negative-true master reset input. Connects directly to R3000 RES# input pin. 

BSYSOUTA- 

OUT 

Buffered clock outputs for synchronizing external events. 

BSYSOUTD 


Each of these is an identical clock signal, representing the inverted form of the R3000 output, 
SYSOUT#. 

BSYSOUT# 

OUT 

Buffered R3000 clock output, SYSOUT# for synchronizing external events. Non-inverted form of 
SYSOUT#. 

MEMRD# 

OUT 

Direct negative-true output from R3000, used to indicate that a memory read cycle is in progress. 

MEMWR# 

OUT 

Direct negative-true output from R3000, used to indicate that a memory write cycle is in progress. 

RBSY 

IN 

Positive-true input used to request a memory read stall initiation and termination. This signal is 
normally held in its asserted state and deasserted at the completion of stalls. 

WBSY# 

IN 

Negative-true input used to request a busy indication for subsequent memory write operations. 

CPCO 

IN 

Direct input to the R3000, used to indicate the size of the data (block, word, byte, or other) for 
memory read cycles. 

CPC1 

OUT 

Connection between the R3000 Processor and the R3010 FPA, indicating the status of the 
conditional branch. This pin is provided for diagnostic purposes, only. 

CPC2, CPC3 

I/O 

Direct connections to R3000 pins. 

EXC# 

OUT 

Direct output from R3000, indicating the EXC# signal between the R3000 and the R3010. 

RUN# 

OUT 

Negative-true output from the R3000, indicating that the R3000 is in its RUN state (not stalled). 

BERR# 

IN 

Negative-true input to R3000, used to indicate a bus error in main memory. 

FPA# 

OUT 

Negative-true output indicating the presence of R3010 FPA on the module. 

XEN# 

OUT 

Direct negative true output from the R3000. Used for read buffers output enable. 

FPINT# 

OUT 

Negative-true R3010 interrupt request. 





















































































IDT7RS103 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


ORDERING INFORMATION 


Ordering Part Number 

CPU 

FPA 

l-cache 

D-cache 

Speed 

7RS103N44A16A 

R3000A 

NONE 

16K 

16K 

16 MHz 

7RS 1 03N44A20A 

R3000A 

NONE 

16K 

16K 

20 MHz 

7RS 1 03 N44A25 A 

R3000A 

NONE 

16K 

16K 

25 MHz 

7RS103F44A16A 

R3000A 

R3010A 

16K 

16K 

16 MHz 

7RS103F44A20A 

R3000A 

R3010A 

16K 

16K 

20 MHz 

7RS103F44A25A 

R3000A 

R3010A 

16K 

16K 

25 MHz 

7RS103N55A16A 

R3000A 

NONE 

32K 

32K 

16 MHz 

7RS103N55A20A 

R3000A 

NONE 

32K 

32K 

20 MHz 

7RS 1 03N55A25 A 

R3000A 

NONE 

32K 

32K 

25 MHz 

7RS103F55A16A 

R3000A 

R3010A 

32K 

32K 

16 MHz 

7RS 1 03F55A20 A 

R3000A 

R3010A 

32K 

32K 

20 MHz 

7RS103F55A25A 

R3000A 

R3010A 

32K 

32K 

25 MHz 

7RS103N66A16A 

R3000A 

NONE 

64K 

64K 

16 MHz 

7RS103N66A20A 

R3000A 

NONE 

64 K 

64K 

20 MHz 

7RS1 03N66A25A 

R3000A 

NONE 

64K 

64K 

25 MHz 

7RS103N66A16A 

R3000A 

R3010A 

64K 

64K 

16 MHz 

7RS1 03N66A20A 

R3000A 

R3010A 

64K 

64K 

20 MHz 

7RS1 03N66A25A 

R3000A 

R3010A 

64K 

64K 

25 MHz 


MORE INFORMATION 

For more information on this module, ask your IDT sales office for the Technical Specification and User's Manual. 
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IDT7RS104 

R3001 RISController™ ENGINE 


PRODUCT BRIEF 


BLOCK DIAGRAM 
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IDT7RS104 

R3001 RISController™ ENGINE PRODUCT BRIEF 


SIGNALS PROVIDED ON MODULE PINS 


Signal Name 

Type 

Description 

STANDARD LocalBus SIGNALS 

MD00-MD31 

I/O 

Memory data lines to/from main memory system. 

MA00-MA31 

I/O 

Memory address lines to main memory system. These are registered inputs/outputs. 

LRD# 

OUT 

Negative-true output which indicates that a memory read cycle is in progress. 

LWR# 

OUT 

Negative-true output which indicates that a memory write cycle is in progress. 

RACK# 

IN 

Negative-true input which is used to indicate that the main memory read cycle initiated by the 
IDT7RS105 has been completed. 

WACK# 

IN 

Negative-true input which is used to indicate that the main memory write cycle initiated by the 
IDT7RS105 has been completed. 

BE0#-BE3# 

OUT 

Negative-true output which indicates which byte is being accessed during main memory read or 
write operations. These four signals are valid only when LRD# or LWR# istrue. 

UINTO 

IN 

User interrupt input. Has a 10K ohm pullup load resistor. 

UINT1 

IN 

Interrupt input which is driven by the R3010 Floating Point Accelerator on the standard module. 

Has a 10K ohm pullup load resistor. 

UINT2 

IN 

User interrupt input. Has a 10K ohm pullup load resistor. 

UINT3 

IN 

Interrupt input which is driven by the 8254 Counter/Timer on the standard module. Has a 10K ohrr 
pullup load resistor. 

UINT4 

IN 

Interrupt input which is driven by the 8254 Counter/Timer on the standard module. Has a 10K ohrr 
pullup load resistor. 

UINT5 

IN 

Interrupt input which is driven by the SN2681 DUART on the standard module. Has a 10K ohm 
pullup load resistor. 

R3KOSCIN 

IN 

Oscillator input clock signal for the R3000/R3010. (2x clock rate). 

BSYSOUT# 

OUT 

Buffered clock outputs for synchronizing external events. This signal represents the non-inverted 
form of the R3000 output, SYSOUT#. 

BSYSCLK A-D 

OUT 

Buffered clock outputs for synchronizing external events. Each of these four outputs is an identical 
signal, representing the inverted form of the R3000 output SUSOUT#. 

MREQ# 

OUT 

Negative-true, one-clock cycle long output which indicates the start of a main memory read or write 
cycle. 

RESET# 

IN 

Negative-true master reset input. 

CPTO 

IN 

Indirect input to R3000 Processor. Transfers conditional branch status from an external 
coprocessor to the R3000. 

CPC1 

OUT 

Connection between the R3000 Processor and the R3010 FPA. Transfers conditional branch 
status from the R301 0 to the R3000. This pin is provided for diagnostic purposes only. 

CPC2-CPC3 

IN 

Direct inputs to the R3000 Processor. Transfers conditional branch status from external 
coprocessors to the R3000. 

NON-STANDARD LocalBus SIGNALS 

UARTCLKOUT 

OUT 

Clock output from a 3.6864 MHz oscillator on board the module. 

CLK8254 

IN 

Clock input to the 82C54 Interval Timer. May be shorted to UARTCLKOUT by the user. 

EXTCSUART# 

IN 

Negative-true input which is the chip select for the SN2681 DUART. 


List continued on following page 
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IDT7RS104 

R3001 RISController™ ENGINE 


PRODUCT BRIEF 


SIGNALS PROVIDED ON MODULE PINS (CONTINUED) 


EXTCSTIM# 

IN 

Negative-true input which is the chip select for the 8254 Counter/ Timer. 

INTCSUART# 

OUT 

Negative-true output of an on-board decoder. Decodes the UART address space as 0x1 FE00000- 
OxIFEOFFFF. May be shorted to EXTCSUART# by the user. 

INTCSTIM# 

OUT 

Negative-true output of an on-board decoder. Decodes the Interval Timer address space as 
Ox1FE40000-Qx1FE4FFFF. May be shorted to EXTCSTIM# by the user. 

DIRECT MEMORY ACCESS (DMA) CONTROL SIGNALS 

DICLK 

IN 

Latch enable to the instruction cache’s address latch which is used during DMAs. 

DDCLK 

IN 

Latch enable to the data cache’s address latch which is used during DMAs. 

DIWR# 

IN 

Negative-true input which is the write enable to the instruction cache RAMs. DIWR# is used 
during DMAs. 

DDWR# 

IN 

Negative-true input which is the write enable to the data cache RAMs. DDWR# is used during 
DMAs. 

DIRD# 

IN 

Negative-true input which is the output enable to the instruction cache RAMs. DIRD# is used 
during DMAs. 

DDRD# 

IN 

Negative-true input which is the output enable to the data cache RAMs. DDRD# is used during 
DMAs. 

TAGV 

IN 

Tag Valid input to the R3001. Has a 10K ohm pullup load resistor. 

UDMA 

IN 

Positive-true, buffered input to the R3001’s DMA pin. Has a 10K ohm pulldown load resistor. 

DCTL# 

IN 

Negative-true, buffered input which is the output enable for all of the DMA signals. Can be driven 
true only after UDMA becomes true. Also acts as the output enable for the R3001 side of the 
address buffers. Has a 10K ohm pullup load resistor. 

DAOE# 

IN 

Negative-true, buffered input which is the output enable to the main memory side of the address 
buffers. Can be used only when DCTL# is true. Has a 1 0K ohm pullupload resistor. 

DMAEN# 

IN 

Negative-true, buffered input which is the clock enable to the main memory side of the address 
buffers. Can be used only when DCTL# is true. 

DAEN# 

IN 

Negative-true, buffered input which is the clock enable to the R3001 side of the address buffers. 
Also act as the read data clock enable to the data buffers. Can only be used when DCTL# is true. 
Has a 10K ohm pullup load resistor. 

DWOE# 

IN 

Negatvie-true, buffered input which is the write data output enable to the data buffers. Can only be 
used when DCTL# is true. Has a 10K ohm pulldown load resistor. 

DXEN# 

IN 

Negative-true, buffered input which is the read data output enable to the data buffers. Can only be 
used when DCTL# is true. Has a 10K ohm pullup load resistor. 

DUART CONTROL SIGNALS 

RxDA 

IN 

Direct SCN2681 input. 

RxDB 

IN 

Direct SCN2681 input. 

TxDA 

OUT 

Direct SCN2681 output. 

TxDB 

OUT 

Direct SCN2681 output. 

IP0-IP1 

IN 

Direct SCN2681 inputs. 


IP2-IP6 


OPO-OP7 I OUT Direct SCN2681 outputs. 







































































IDT7RS104 

R3001 RISController™ ENGINE 


PRODUCT BRIEF 


RELATED PRODUCTS 
IDT/sIm 

The 7RS1 04 module includes IDT's monitor in EPROM on 
board. The IDT7RS901 System Integration Manager (IDT/ 
sim) is a ROMable software product that permits convenient 
control and debug of RISC systems built around the MIPS 
R3000 architecture. It permits users to quickly develop and 
debug stand-alone systems. Facilities are included to 
operate the CPU under controlled conditions, examining 
and altering the contents of memory, manipulating and 
controlling R3000 resources (such as cache, TLB and 
coprocessors), loading programs from host machines, and 
controlling the path of execution of loaded programs. Re- 
mote (source/symbolic) debugging is also supported. 
IDT/sim requires 82Kb of EPROM space for code and data 
and 16Kb of RAM space for uninitialized variable data and 
stack. The minimal I/O system supported uses UARTS. The 
default drivers support the 2681 or 68681 devices. Other 
devices can be added easily. 


Prototyping Platform 

A Prototyping Platform is in development for this product. 
Please contact your IDT sales office for latest status and 
technical information. 


ORDERING INFORMATION 


Ordering Part Number 

CPU 

FPA 

l-memory 

D-memory 

Speed 

Other 

7RS104F77A16A 

R3001 

R3010 

128KB 

128KB 

16 MHz 

EPROM socketed 

7RS104F77A20A 

R3001 

R3010 

128KB 

128KB 

20 MHz 

EPROM socketed 

7RS1 04F77A25A 

R3001 

R3010 

128KB 

128KB 

25 MHz 

EPROM socketed 


CUSTOM OPTIONS 

Most software features of the 7RS104 can be modified by special order. Contact sales office for details. 


MORE INFORMATION 

For more information on this module, ask your IDT sales office for the Technical Specification and User's Manual. 
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R3000 CPU MODULES 

FOR HIGH PERFORMANCE AND 

MULTIPROCESSOR SYSTEMS 


IDT7RS107 


FEATURES: 

• Cache Size: 64K Instruction, 64K Data 

• Processor Speeds up to 33 MHz 

• Includes R3010 Floating Point Accelerator 

• 1-word Read Buffer; 4-word Write Buffer 

• Supports R3000 Multiprocessor Features 

• Entire 1-Cache can be invalidated with external 
cache reset signal 

• Eight-word block refills 

• On-board oscillator, delay line, and reset 
circuitry 

• 100% burn-ln and functional test at rated speed 


R3000 MODULE FOR HIGH PERFORMANCE 
CPUs AND MULTIPROCESSOR SYSTEMS: 

The IDT7RS107 is a complete reduced instruction set 
computer (RISC) CPU, based on the MIPS R3000 RISC 
processor, and supplied on a small fully-tested high-density 
plug-in module. The module includes the R3000 CPU, the 
R3010 Floating Point Accelerator, 64 Kbytes each of data 
and instruction cache memory, a single word read buffer and 
a four-word write buffer. Clock generation, reset, control 
and interrupt functions are included on the module to sim- 
plify the remainder of the system design. 

The 107 module is designed to support the R3000’s 
multiprocessor features. Data in the D-cache can be invali- 
dated by the R3000 CPU. It is also possible to invalidate the 
entire contents of the l-cache in a single cycle by using an 
external cache reset signal. 

The module is constructed using surface mount devices 
on a 5.2" by 5.2" epoxy laminate board, and is connected to 
the user's system via 1 95 pins located in two pin row regions 
on the board. 



7RS1 07 Module. Actual Size 5.2" x 5.2 


RSD PB107/A/R 


DECEMBER 1990 


©1990 Integrated Device Technology, Inc. 
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IDT7RS107 

R3000 RISC CPU MODULE 


ARCHITECTURAL HIGHLIGHTS 
Uses R3020 Write Buffers 

R3020 chips are used on the module to provide a “smart" 
four-deep write buffer between the CPU and external 
memory. These devices store data and addresses for up to 
four write requests to main memory, and handle the hand- 
shaking with the memory controller. The R3020S support 
features such as byte gathering (combining multiple byte 
writes to the same address in the buffer into a single write) 
and address matching (a read or write to an address already 
in the write buffer will be detected so the user software can 
take appropriate action). The R3020's Match signals are 
OR'ed on the module to produce a single output, labeled 
CONFLICT. 

Resettable Instruction Cache 

The 7RS107 module permits invalidation of the entire 
instruction cache via a “cache reset” pin on the module. This 
feature is used to wipe the cache clean when the a block of 
instructions in main memory have been changed by a DMA 
operation. It is usually much faster than invalidating each 
affecting tag individually. 

Multiprocessor Invalidate In Data Cache 

The module supports the R3000’s multiprocessor cache 
invalidate feature, so that data cache coherency can be 
maintained when data held in the cache is altered externally. 
The R3000’s MP Stall and MP Invalidate signals are avail- 
able as pins on the module. The user’s system stalls the 
processor and then provides an address to the module while 
signaling MP Invalidate. The module stores the address in 
a latch and applies it to the cache at the right time for the 
R3000 to invalidate the referenced tag. 

Eight-Word Block Refill 

The module refills both the instruction and data caches 
from memory in eight-word blocks. Following a cache miss, 
the processor will request a memory read at the missed 
address and wait for a data ready acknowledgement. When 
an acknowledge is received, the processor will load eight 
words into cache on eight successive clock cycles. The 
memory interface must supply the correct eight words (ad- 
dress A4A3A2 = 0 to 7) at the processor's speed, 40 ns 
intervals for a 25 MHz system. Interleaved memory is 
usually the best way to support this requirement. The 
processor’s CPCO pin, available as a pin on the module, can 
be used to over-ride the block refill on data, but instructions 
refills must always be in 8-word blocks. The processor 
performs instruction streaming during the refill. 


PRODUCT BRIEF 


On-board Oscillator and Delay Line 

All the clock generation circuitry required by the R3000 
system is on the module. A jumper can be used to select 
between the on-board crystal oscillator or an external oscil- 
lator input. A delay line on the module is used to set the 
timing for register strobes and other critical signals relative 
to the R3000 clock. The R3000 clock output “SYSOUT” is 
made available to the user system through eight pins on the 
module, each independently buffered. 

R3000 Reset and Initialization Logic 

The initialization logic for the R3000 CPU is contained on 
the module. A “Cold Reset” pin on the module starts the 
required 15 ms reset signal to the CPU, and then provides 
the initialization vectors during the last few cycles. A second 
reset pin is provided to reinitialize the CPU without repeating 
the 15 ms delay. The R3000 is initialized to “Big-Endian” 
operation. 

Five User Interrupt Lines 

Five pins on the module are used for user interrupt inputs. 
The user interrupts are synchronized in registers on the 
module before being sent to the R3000. Interrupt 2 is used 
for the Floating Point Accelerator, if present. 

External R3000 Condition Code Pin 

The R3000 input CPCO is available as a pin on the 
module. During initialization, this pin is programmed as a 
Condition Code test pin, so the R3000 can do a Test and 
Branch in a single cycle based on its state. During read 
stalls, the pin determines whether a single word or 8 words 
will be read. Reads into the instruction cache must always 
be block refills. 

TYPICAL APPLICATIONS 

The 7RS107 module is designed for applications that run 
complex operating systems, such as UNIX™, or that need 
outside control of cache memory contents, such as multipro- 
cessor systems. 

The module supports the R3000’s ability to invalidate 
entries in the data cache, allowing multiple processor sys- 
tems to maintain cache coherency. 

The module is offered with the maximum possible cache 
sizes (64K each) that can be supported by the R3000 in a 
multiprocessor configuration. These sizes are well suited to 
running UNIX at very high instruction rates as well. 

The R3020 Write Buffer is used to provide a four-word 
deep write buffer, which is ideal for most UNIX systems. 
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IDT7RS107 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 
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IDT7RS107 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


SIGNALS PROVIDED ON MODULE PINS 


Signal Name Type 



BACTO.1,2 


MDP0...MDP3 


CP_CpCondO, 2, 3 



WB_OutEn 


WB_Request 


WB_AcknowIedge 



Description 


32-bit address from the module to external memory. This is an output from the 3020 Write Buffer 
except during the MP Invalidate function, when it is the input to the MP cache address latch. 


32-bit data bus between the module and external memory. Driven from the 3020 Write Buffer during 
writes; input to the Read Data Buffer during reads. 


The three R3000 AccType status signals, driven from the 3020 Write Buffer during writes and from 
a latch during reads. 


The four parity bits for the MD data. Output during writes and input during reads. 


The three flag inputs to the R3000 CPU. CPC0 is used during read stalls to control block refill of the 
data cache. (The instruction cache must always be block refilled.) CPC2 and CPC3 are the MP stall 
and invalidate controls. 


Data Cache Address Latch Output Enable When LOW, enables the output of the latch holding the 
data cache address supplied by the R3000. It should be LOW at all times except when the MP Latch 
is being used to invalidate a cache address. 


Data Cache MP Address Latch Output Enable. This input is used to enable the output of the latch 
holding the address supplied by the user system during an MP stall cycle. It should be enabled (LOW) 
only during the MP invalidate operation. 


Eight buffered inverted copies of the R3000 signal "SYSOUT” for use in the user’s system. 


Interrupt inputs to the R3000. These signals are synchronized to SYSOUT on the module. R3000 
interrupt 2 is used for the Floating Point Accelerator. 


Buffered copy of the reset signal created on the module to reset the CPU. LOW during Reset. 


Write Busy. Status signal created by the R3020 write buffer. Goes LOW to indicate the buffer is full. 


Input to the R3000 indicating a bus error has occurred. 


Cold Reset to the module. The module creates a 1 5 ms long reset to the R3000 and executes the 
R3000 initialization sequence when this pin goes LOW. 


This signal can be used to detect the presence of an FPA on the module. To be used, it must be 
connected to a 4.7K pullup resistor. The pin will be LOW if the FPA is present. 


Active LOW asynchronous clear to the 1-Cache Tag RAMS. Sets the entire 1-Cache invalid. 


Write Buffer Output Enable. When LOW, turns on the outputs of the R3020 write buffers. 


Output from the R3020 to indicate that there is data in the buffer to be written to memory. Active LOW 


Input to the R3020 to indicate data has been written into memory. 


The OR of all the R3020 Match signals; indicates the address on the R3020 inputs matches one of 
the addresses currently in the write buffer. 


Read Address Buffer Output Enable. When LOW, turns on outputs of the buffers containing the read 
address. 


Read Data Buffer Clock Enable. When LOW, enables the clock (SYSOUT) to the Read Data Buffers. 


Status signal output. LOW during reads. 


Read Address Buffer Latch Enable. When HIGH, enables the Read Address Buffer latches. 


Latch Error Address input to the R3020. 


Enable Error Address input to R3020. 


R3000 output signal. When LOW, there is a request for a read from external memory. 


Read Busy. Input to the R3000 to indicate acknowledgment of the MEMRD request. 


Additional Reset command. Same as RESETC, but does not go through the 15 ms delay. Can be 
used to re-initialize the R3000 when power is on. 










































































IDT7RS107 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


RELATED PRODUCTS 


Prototyping System 

The 7RS107 module can be placed into immediate ser- 
vice using our flexible 7RS307 Prototyping Platform. The 
system includes two boards: a general purpose CPU board, 
and a personality card that interfaces the module to the C PU 
board. 

The CPU board contains 1Mb of main memory, 256K of 
EPROM, two RS232 serial ports, an 8254 counter/timer, 
and an 8-bit parallel port accessible through a dual port 
RAM. Four 50-pin connectors provide access to all the 
address, data, and control signals for external connection to 
additional hardware on, for example, a wire-wrap board. 


The system includes IDT’s Software Integration Man- 
ager, which provides facilities for downloading code, exam- 
ining memory, and stepping through programs. 

The personality card is on a separate board, and provides 
a bed for the module, necessary control signals, and con- 
nectors for an HP 16500 Logic Analyzer. 

Code for the R3000 can be created on a MIPS develop- 
ment system, on IDT's MacStation™ system, or using IDT’s 
PC-based cross assembler and compiler products. As- 
sembled code can be downloaded into the Prototyping 
System for execution and debug. 



A Module Prototyping Platform. 

The card on the left is the personality card with a module; the card on the right is the general purpose CPU. 
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IDT7RS107 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


ORDERING INFORMATION 


Ordering Part Number 

CPU 

FPA 

i-cache 

D- cache 

Speed 

Other 

7RS107N66A16A 

R3000A 

NONE 

64K 

64K 

16 MHz 


7RS107N66A20A 

R3000A 

NONE 

64K 

64 K 

20 MHz 


7RS107N66A25A 

R3000A 

NONE 

64K 

64K 

25 MHz 


7RS1 07N66A33A 

R3000A 

NONE 

64K 

64K 

33 MHz 


7RS107F66A16A 

R3000A 

R3010A 

64K 

64K 

16 MHz 


7RS1 07F66A20A 

R3000A 

R3010A 

64K 

64 K 

20 MHz 


7RS107F66A25A 

R3000A 

R3010A 

64K 

64 K 

25 MHz 


7RS1 07F66A33A 

R3000A 

R301 0A 

64K 

64K 

33 MHz 



CUSTOM OPTIONS 

Some features of the 7RS1 07 can be modified by special 
order. Contact your IDT sales office for details. 

Software modifications include: initialization mode forthe 
R3000, endian option, size of block refill, instruction streaming 
option. 


Manufacturing options include pin length, style, and plat- 
ing; special marking; additional burn-in, and socketing of the 
CPU and/or FPA. 
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Integrated Device Technology, Inc. 


R3000 CPU MODULES 
WITH 256K CACHES 


IDT7RS1 08 


FEATURES: 

• Cache Size: 256KB Instruction, 256KB Data 

• Processor Speeds up to 25 MHz 

• Includes R3010 Floating Point Accelerator 

• 1-word Read Buffer; 4-word Write Buffer 

• Eight-word block refills 

• On-board Parity Generation and Check 

• On-board oscillator, delay line, and reset 
circuitry 

• 100% burn-in and functional test at rated speed 


R3000 MODULE FOR HIGH PERFORMANCE 
CPUs: 

The IDT7RS108 is a complete reduced instruction set 
computer (RISC) CPU, based on the MIPS R3000 RISC 
processor, and supplied on a small fully-tested high-density 
plug-in module. The module includes the R3000 CPU, the 
R3010 Floating Point Accelerator, 256 Kbytes each of data 
and instruction cache memory, a single word read buffer and 
a four-word write buffer. 

Clock generation, reset, parity, control and interrupt func- 
tions are included on the module to simplify the remainder 
of the system design. 

The 7RS108 module is pin compatible with the 7RS107 
and 7RS109 modules (with 64K caches), but does not 
support the multiprocessing features offered by those 
modules. 

The module is constructed using surface mount devices 
on a 5.2" by 5.2” epoxy laminate board, and is connected to 
the user’s system via 1 95 pins located in two pin row regions 
on the board. 



7RS1 08 Module. Actual Size 5.2" x 5.2" 


RSD PB1 08/A/I 


DECEMBER 1990 


©1990 Integrated Device Technology. Inc. 
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IDT7RS108 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


ARCHITECTURAL HIGHLIGHTS 
Uses R3020 Write Buffers 

R3020 chips are used on the module to provide a “smart” 
four-deep write buffer between the CPU and external 
memory. These devices store data and addresses for up to 
four write requests to main memory, and handle the hand- 
shaking with the memory controller. The R3020s support 
features such as byte gathering (combining multiple byte 
writes to the same address in the buffer into a single write) 
and address matching (a read orwrite to an address already 
in the write buffer will be detected so the user software can 
take appropriate action). The R3020’s Match signals are 
OR'ed on the module to produce a single output, labeled 
CONFLICT. 

Eight-Word Block Refill 

The module refills both the instruction and data caches 
from memory in eight-word blocks. Following a cache miss, 
the processor will request a memory read at the missed 
address and wait for a data ready acknowledgement. When 
an acknowledge is received, the processor will load eight 
words into cache on eight successive clock cycles. The 
memory interface must supply the correct eight words (ad- 
dress A4A3A2 = 0 to 7) at the processor’s speed, 40 ns 
intervals for a 25 MHz system. Interleaved memory is 
usually the best way to support this requirement. The 
processor's CPCO pin, available as a pin on the module, can 
be used to over-ride the block refill . The processor performs 
instruction streaming during the refill. 

On-board Oscillator and Delay Line 

All the clock generation circuitry required by the R3000 
system is on the module. A jumper can be used to select 
between the on-board crystal oscillator or an external oscil- 
lator input. A delay line on the module is used to set the 
timing for register strobes and other critical signals relative 
to the R3000 clock. The R3000 clock output “SYSOUT” is 
made available to the user system through eight pins on the 
module, each independently buffered. 


R3000 Reset and Initialization Logic 

The initialization logic forthe R3000 CPU is contained on 
the module. A “Cold Reset” pin on the module starts the 
required 15 ms reset signal to the CPU, and then provides 
the initialization vectors during the last few cycles. A second 
reset pin is providedto reinitialize the CPUwithout repeating 
the 15 ms delay. The R3000 is initialized to "Big-Endian" 
operation. 

Five User Interrupt Lines 

Five pins on the module are used for user interrupt inputs. 
The user interrupts are synchronized in registers on the 
module before being sent to the R3000. Interrupt 2 is used 
for the Floating Point Accelerator, if present. 

External R3000 Condition Code Pin 

The R3000 input CPCO is available as a pin on the 
module. During initialization, this pin is programmed as a 
Condition Code test pin, so the R3000 can do a Test and 
Branch in a single cycle based on its state. During read 
stalls, the pin determines whether a single word or 8 words 
will be read. Reads into the instruction cache must always 
be block refills. 

TYPICAL APPLICATIONS 

The 7RS108 module is designed for applications that run 
complex operating systems, such as UNIX™, or that need 
the maximum possible performance. The module contains 
the maximum possible cache sizes (256K each) that can be 
supported by the R3000. 
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IDT7RS108 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


FUNCTIONAL BLOCK DIAGRAM 


CPU Controls 



& Parity 
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IDT7RS108 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


SIGNALS PROVIDED ON MODULE PINS 


Signal Name 

Type 

Description 

MA0...MA31 

I/O 

32-bit address from the module to external memory. This is an output from the 3020 Write Buffer 
except during the MP Invalidate function, when it is the input to the MP cache address latch. 

MD0...MD31 

I/O 

32-bit data bus between the module and external memory. Driven from the 3020 Write Buffer during 
writes; input to the Read Data Buffer during reads. 

BACTO.1,2 

o 

The three R3000 AccType status signals, driven from the 3020 Write Buffer during writes and from 
a latch during reads. 

MDP0...MDP3 

I/O 

The four parity bits for the MD data. Output during writes and input during reads. 

CP_CpCondO, 2, 3 

1 

The three flag inputs to the R3000 CPU. CPC0 is used during read stalls to control block refill of the 
data cache. (The instruction cache must always be block refilled.) CPC2 and CPC3 are the MP stall 
and invalidate controls. 

ALOE 

■ 

Data Cache Address Latch Output Enable When LOW, enables the output of the latch holding the 
data cache address supplied by the R3000. It should be LOW at all times except when the MP Latch 
is being used to invalidate a cache address. 

BSYSOUT2...9 

o 

Eight buffered inverted copies of the R3000 signal “SYSOUT for use in the user’s system. 

UINTO.1,3,4,5 

1 

Interrupt inputs to the R3000. These signals are synchronized to SYSOUT on the module. R3000 
interrupt 2 is used for the Floating Point Accelerator. 

BRESET 

o 

Buffered copy of the reset signal created on the module to reset the CPU. LOW during Reset. 

WB_WbFull 

0 

Write Busy. Status signal created by the R3020 write buffer. Goes LOW to indicate the buffer is full. 

CPU_BusError 

1 

Input to the R3000 indicating a bus error has occurred. 

RESETC 

1 

Cold Reset to the module. The module creates a 15 ms long reset to the R3000 and executes the 
R3000 initialization sequence when this pin goes LOW. 

FP_FpPresent 

o 

This signal can be used to detect the presence of an FPA on the module. To be used, it must be 
connected to a 4.7K pullup resistor. The pin will be LOW if the FPA is present. 

WB_OutEn 

1 

Write Buffer Output Enable. When LOW, turns on the outputs of the R3020 write buffers. 

WB_Request 

o 

Output from the R3020 to indicate that there is data in the buffer to be written to memory. Active LOW 

WB_Acknowledge 

1 

Input to the R3020 to indicate data has been written into memory. 

CONFLICT 

o 

The OR of all the R3020 Match signals; indicates the address on the R3020 inputs matches one of 
the addresses currently in the write buffer. 

RABOE 

1 

Read Address Buffer Output Enable. When LOW, turns on outputs of the buffers containing the read 
address. 

RDBCE 

1 

Read Data Buffer Clock Enable. When LOW, enables the clock (SYSOUT) to the Read Data Buffers. 

READ 

o 

Status signal output. LOW during reads. 

RABLE 

1 

Read Address Buffer Latch Enable. When HIGH, enables the Read Address Buffer latches. 

WB_LatchErrAddr 

1 

Latch Error Address input to the R3020. 

WB_EnErrAddr 

1 

Enable Error Address input to R3020. 

CP_MemRd 

o 

R3000 output signal. When LOW, there is a request for a read from external memory. 

CP_RdBusy 

1 

Read Busy. Input to the R3000 to indicate acknowledgment of the MEMRD request. 




































































IDT7RS108 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


RELATED PRODUCTS 

Prototyping System 

The 7RS108 module can be placed into immediate ser- 
vice using our flexible 7RS308 Prototyping Platform. The 
system includes two boards: a general purpose CPU board, 
and a personality card that interfaces the module to the CPU 
board. 

The CPU board contains 1Mb of main memory, 256K of 
EPROM, two RS232 serial ports, an 8254 counter/timer, 
and an 8-bit parallel port accessible through a dual port 
RAM. Four 50-pin connectors provide access to all the 
address, data, and control signals for external connection to 
additional hardware on, for example, a wire-wrap board. 


The system includes IDT’s Software Integration Man- 
ager, which provides facilities for downloading code, 
examining memory, and stepping through programs. 

The personality card is on a separate board, and provides 
a bed for the module, necessary control signals, and con- 
nectors for an HP 16500 Logic Analyzer. 

Code for the R3000 can be created on a MIPS develop- 
ment system, on IDT’s MacStation™ system, or using IDT’s 
PC-based cross assembler and compiler products. As- 
sembled code can be downloaded into the Prototyping 
System for execution and debug. 



A Module Prototyping Platform. 

The card on the left Is the personality card with a module; the card on the right Is the general purpose CPU. 
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IDT7RS108 

R3000 RISC CPU MODULE 


PRODUCT BRIEF 


ORDERING INFORMATION 


Ordering Part Number 

CPU 

FPA 

1-Cache 

D-Cache 

Speed 

Other 

7RS108N88A16A 

R3000A 

NONE 

256K 

256K 

16 MHz 


7RS1 08N88A20A 

R3000A 

NONE 

256K 

256K 

20 MHz 


7RS1 07N88A25A 

R3000A 

NONE 

256K 

256K 

25 MHz 


7RS108F88A16A 

R3000A 

R3010A 

256K 

256K 

16 MHz 


7RS108F88A20A 

R3000A 

R3010A 

256K 

256K 

20 MHz 


7RS108F88A25A 

R3000A 

R3010A 

256K 

256K 

25 MHz 



For detailed design information, contact IDT and ask for the 7RS108 Technical Specification. 


CUSTOM OPTIONS 

Some features of the 7RS108 can be modified by special order. Contact your IDT sales office for details. 

Software modifications include: initialization mode forthe R3000, endian option, size of block refill, instruction streaming 
option. 

Manufacturing options include pin length, style, and plating; special marking; additional burn-in, and socketing of the 
CPU and/or FPA. 












R3000 CPU MODULES 


ADVANCE 

INFORMATION 

IDT7RS109 



FEATURES: 

Cache Size: 64K Instruction, 64K Data 

Processor Speeds up to 33 MHz 

Includes R3010 Floating Point Accelerator 

1-word Read Buffer; 4-word Write Buffer 

Supports R3000 Multiprocessor Features 

On-Board Parity Check and Generate 

Eight-word block refills 

On-board oscillator, delay line, and reset 
circuitry 

100% burn-in and functional test at rated speed 


R3000 MODULE FOR HIGH PERFORMANCE 
CPUs AND MULTIPROCESSOR SYSTEMS: 

The IDT7RS109 is a complete reduced instruction set 
computer (RISC) CPU, based on the MIPS R3000 RISC 
processor, and supplied on a small fully-tested high-density 
plug-in module. The module includes the R3000 CPU, the 
R3010 Floating Point Accelerator, 64 Kbytes each of data 
and instruction cache memory, a single word read bufferand 
a four-word write buffer. Clock generation, reset, control, 
parity, and interrupt functions are included on the module to 
simplify the remainder of the system design. 

The 109 module includes a latch to hold an external 
address for snooping in the D-cache and is designed to 
support the R3000’s multiprocessor features. 

The module is constructed using surface mount devices 
on a 5.2" by 5.2" epoxy laminate board, and is connected to 
the user’s system via 1 95 pins located in two pin row regions 
on the board. 



7RS1 09 Module. Actual Size 5.2" x 5.2 


RSD PB109/A/R 


DECEMBER 1990 

©1990 Integrated Device Technology, Inc. 
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R3000 RISC CPU MODULE 


PRODUCT BRIEF 


ARCHITECTURAL HIGHLIGHTS 
Uses R3020 Write Buffers 

R3020 chips are used on the module to provide a “smart” 
four-deep write buffer between the CPU and external 
memory. These devices store data and addresses for up to 
four write requests to main memory, and handle the hand- 
shaking with the memory controller. The R3020s support 
features such as byte gathering (combining multiple byte 
writes to the same address in the buffer into a single write) 
and address matching (a read orwrite to an address already 
in the write buffer will be detected so the user software can 
take appropriate action). The R3020's Match signals are 
OR'ed on the module to produce a single output, labeled 
CONFLICT. 

Eight-Word Block Refill 

The module refills both the instruction and data caches 
from memory in eight-word blocks. Following a cache miss, 
the processor will request a memory read at the missed 
address and wait for a data ready acknowledgement. When 
an acknowledge is received, the processor will load eight 
words into cache on eight successive clock cycles. The 
memory interface must supply the correct eight words (ad- 
dress A4A3A2 = 0 to 7) at the processor's speed, 40 ns 
intervals for a 25 MHz system. Interleaved memory is 
usually the best way to support this requirement. The 
processor’s CPCO pin, available as a pin on the module, can 
be used to over-ride the block refill. The processor performs 
instruction streaming during the refill. 

On-board Oscillator and Delay Line 

All the clock generation circuitry required by the R3000 
system is on the module. A jumper can be used to select 
between the on-board crystal oscillator or an external oscil- 
lator input. A delay line on the module is used to set the 
timing for register strobes and other critical signals relative 
to the R3000 clock. The R3000 clock output “SYSOUT" is 
made available to the user system through eight pins on the 
module, each independently buffered. 

R3000 Reset and Initialization Logic 

The initialization logic for the R3000 CPU is contained on 
the module. A “Cold Reset” pin on the module starts the 
required 15 ms reset signal to the CPU, and then provides 
the initialization vectors during the last few cycles. A second 
reset pin is provided to reinitialize the CPU without repeating 
the 15 ms delay. The R3000 is initialized to “Big-Endian” 
operation. 


Five User Interrupt Lines 

Five pins on the module are used for user interrupt inputs. 
The user interrupts are synchronized in registers on the 
module before being sent to the R3000. Interrupt 2 is used 
for the Floating Point Accelerator, if present. 

External R3000 Condition Code Pin 
The R3000 input CPCO is available as a pin on the 
module. During initialization, this pin is programmed as a 
Condition Code test pin, so the R3000 can do a Test and 
Branch in a single cycle based on its state. During read 
stalls, the pin determines whether a single word or 8 words 
will be read. Reads into the instruction cache must always 
be block refills. 

Internal Parity Check and Generate 

The 7RS109 uses IDT 7321 1 registers for the data read 
buffer. This device provides the ability to generate parity on 
incoming data, if it is not already present, or to check parity 
on outgoing data to detect parity errors occuring on the 
module. 

TYPICAL APPLICATIONS 

The 7RS1 09 module is designed for applications that run 
complex operating systems, such as UNIX™, or that need 
the maximum possible performance. The module contains 
the maximum possible cache sizes (64K each) that can be 
supported by the R3000 in Multiprocessor configurations. 
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FUNCTIONAL BLOCK DIAGRAM 
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R3000 RISC CPU MODULE 


PRODUCT BRIEF 


SIGNALS PROVIDED ON MODULE PINS 


Signal Name 

Type 

Description 

MA0...MA31 

I/O 

32-bit address from the module to external memory. This is an output from the 3020 Write Buffer 
except during the MP Invalidate function, when it is the input to the MP cache address latch. 

MD0...MD31 

I/O 

32-bit data bus between the module and external memory. Driven from the 3020 Write Buffer during 
writes; input to the Read Data Buffer during reads. 

BACT0.1 ,2 

o 

The three R3000 AccType status signals, driven from the 3020 Write Buffer during writes and from 
a latch during reads. 

MDP0...MDP3 

I/O 

The four parity bits for the MD data. Output during writes and input during reads. 

CP_CpCondO, 2, 3 

■ 

The three flag inputs to the R3000 CPU. CPC0 is used during read stalls to control block refill of the 
data cache. (The instruction cache must always be block refilled.) CPC2 and CPC3 are the MP stall 
and invalidate controls. 

ALOE 

■ 

Data Cache Address Latch Output Enable When LOW, enables the output of the latch holding the 
data cache address supplied by the R3000. It should be LOW at all times except when the MP Latch 
is being used to invalidate a cache address. 

BSYSOUT2...9 

o 

Eight buffered inverted copies of the R3000 signal “SYSOUT” for use in the user’s system. 

UINT0.1 ,3,4,5 

1 

Interrupt inputs to the R3000. These signals are synchronized to SYSOUT on the module. R3000 
interrupt 2 is used for the Floating Point Accelerator. 

BRESET 

o 

Buffered copy of the reset signal created on the module to reset the CPU. LOW during Reset. 

WB_WbFull 

o 

Write Busy. Status signal created by the R3020 write buffer. Goes LOW to indicate the buffer is full. 

CPU_BusError 

1 

Input to the R3000 indicating a bus error has occurred. 

RESETC 


Cold Reset to the module. The module creates a 15 ms long reset to the R3000 and executes the 
R3000 initialization sequence when this pin goes LOW. 

FP_FpPresent 

o 

This signal can be used to detect the presence of an FPA on the module. To be used, it must be 
connected to a 4.7K pullup resistor. The pin will be LOW if the FPA is present. 

WB_OutEn 

1 

Write Buffer Output Enable. When LOW, turns on the outputs of the R3020 write buffers. 

WB_Request 

o 

Output from the R3020 to indicate that there is data in the buffer to be written to memory. Active LOW 

WB_Acknowledge 

1 

Input to the R3020 to indicate data has been written into memory. 

CONFLICT 

o 

The OR of all the R3020 Match signals; indicates the address on the R3020 inputs matches one of 
the addresses currently in the write buffer. 

RABOE 

1 

Read Address Buffer Output Enable. When LOW, turns on outputs of the buffers containing the read 
address. 

RDBCE 

1 

Read Data Buffer Clock Enable. When LOW, enables the clock (SYSOUT) to the Read Data Buffers. 

READ 

o 

Status signal output. LOW during reads. 

RABLE 

1 

Read Address Buffer Latch Enable. When HIGH, enables the Read Address Buffer latches. 

WBJLatchErrAddr 

1 

Latch Error Address input to the R3020. 

WB_EnErrAddr 

1 

Enable Error Address input to R3020. 

CP_MemRd 

o 

R3000 output signal. When LOW, there is a request for a read from external memory. 

CP_RdBusy 

1 

Read Busy. Input to the R3000 to indicate acknowledgment of the MEMRD request. 
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R3000 RISC CPU MODULE 


PRODUCT BRIEF 


RELATED PRODUCTS 
Prototyping System 

The 7RS109 module can be placed into immediate ser- 
vice using our flexible 7RS309 Prototyping Platform. The 
system includes two boards: a general purpose CPU board, 
and a personality card that interfaces the module to the CPU 
board. 

The CPU board contains 1 Mb of main memory, 256K of 
EPROM, two RS232 serial ports, an 8254 counter/timer, 
and an 8-bit parallel port accessible through a dual port 
RAM. Four 50-pin connectors provide access to all the 
address, data, and control signals for external connection to 
additional hardware on, for example, a wire-wrap board. 


The system includes IDT’s Software Integration Man- 
ager, which provides facilities for downloading code, 
examining memory, and stepping through programs. 

The personality card is on a separate board, and provides 
a bed for the module, necessary control signals, and con- 
nectors for an HP 16500 Logic Analyzer. 

Code for the R3000 can be created on a MIPS develop- 
ment system, on IDT's MacStation™ system, orusing IDT's 
PC-based cross assembler and compiler products. As- 
sembled code can be downloaded into the Prototyping 
System for execution and debug. 



A Module Prototyping Platform. 

The card on the left is the personality card with a module; the card on the right Is the general purpose CPU. 
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ORDERING INFORMATION 


Ordering Part Number 

CPU 

FPA 

l-cache 

D-cache 

Speed 

Other 

7RS109N66A16A 

R3000A 

NONE 

64K 

64K 

16 MHz 


7RS109N66A20A 

R3000A 

NONE 

64K 

64K 

20 MHz 


7RS1 07N66A25A 

R3000A 

NONE 

64K 

64K 

25 MHz 


7RS107N66A33A 

R3000A 

NONE 

64K 

64K 

33 MHz 


7RS109F66A16A 

R3000A 

R3010A 

64K 

64K 

16 MHz 


7RS109F66A20A 

R3000A 

R3010A 

64K 

64K 

20 MHz 


7RS109F66A25A 

R3000A 

R3010A 

64K 

64K 

25 MHz 


7RS109F66A33A 

R3000A 

R3010A 

64K 

64K 

33 MHz 



CUSTOM OPTIONS 

Some features of the 7RS1 09 can be modified by special 
order. Contact your IDT sales office for details. 

Software modifications include: initialization mode forthe 
R3000, endian option, size of block refill, instruction stream- 
ing option. 


Manufacturing options include pin length, style, and plat- 
ing; special marking; additional burn-in, and socketing of the 
CPU and/or FPA. 
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R3000 CPU MODULES 

IDT7RS110 

Integrated Device Technology, Inc. 

1 




FEATURES: 

• Choice of Cache Sizes: 32 I and D or 32K I and 
16K D 

• Small size: 2.9" x 4.1" 

• Processor Speeds up to 33 MHz 

• Optional R3010 Floating Point Accelerator 

• Single word Read and Write Buffers 

• On-Board Clock, Reset, and Parity Logic 

• 100% burn-in and functional test at rated speed 


R3000 MODULE FOR GENERAL USE IN 
SMALL SYSTEMS: 

The IDT7RS110 is a complete reduced instruction set 
computer (RISC) Subsystem, based on the MIPS R3000 
RISC processor, and supplied on a small fully-tested high- 
density plug-in module. There are two versions of the 
module. One version contains 32 Kbytes each of data and 
instruction cache memory, and the other 32 Kbytes of 
instruction and 16 Kbytes of data cache. Both include an 
R3000 Instruction Set CPU, a single word read buffer, a 
single word write buffer and, optionally, the R301 0 Floating 
Point Accelerator. 

Cache misses are handled with single word read re- 
quests to memory, providing a simple interface to any type 
of main memory system. 

The module is constructed using surface mount devices 
on a two-sided epoxy laminate board, and is connected to 
the user’s system via six rows of 36 pins each. 
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R3000 RISC CPU MODULE 


PRODUCT BRIEF 


ARCHITECTURAL HIGHLIGHTS 

Interchangeable Performance Options 

The 7RS1 1 0 is available in two cache size choices and in 
a variety of speed selections. All versions plug into the same 
footprint, creating a simple way to offer price/performance 
selections for RISC systems. 

R3010 Floating Point Accelerator 

The R3010 Floating Point Accelerator (FPA) is included 
as an integral part of the module. It operates in conjunction 
with the R3000 RISC Processor and greatly improves the 
system performance by expanding the instruction set to 
include very fast floating point capabilities. All timing and 
control connections are on the module. The R3010 can use 
either INTI or INT3 to the R3000 (jumper selectable). 

On-Board Clock Generation 

The 7RS1 1 0 includes on board oscillator and delay line to 
provide the high speed clocks to the R3000 components. 
The R3000 output signal SYSOUT is buffered and four 
copies are brought off the board for use as a system clock. 

Cache Memory 

Cache memory is provided on the module for 32 Kbytes 
of instruction cache and either 32K bytes or 16K bytes for 
data cache. Memory operations which require main memory 
data transfers are conveniently handled by means of on- 
board control signals. 

Cache read miss operations are handled as single-word 
fetch and cache update. Non-cache read operations (such 
as I/O reads) are indicated by control signals. 


Address and Data Buffers 

The address and data lines coming out of the module are 
buffered and can support substantial drive requirements. 
The Address Buffer is a bidirectional register, permitting 
cache snooping for multiprocessor systems. 

The data pins are also driven by bidirectional registers. 
Memory write cycles utilize a single-word write buffer on the 
module which permits the R3000 Processor to continue 
running while data is being written into main memory. 

R3000 User Interrupts 

Six user interrupt inputs are provided, INT0-INT5. Each 
of these is a negative-true signal, terminated with a 1 0K ohm 
pullup load resistor on the module. The pins may be left 
unconnected if not used. The interrupt signals are syn- 
chronized in a PAL before being applied to the interrupt pins 
of the R3000 Processor. 
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FUNCTIONAL BLOCK DIAGRAM 


CPU Controls 
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R3000 RISC CPU MODULE 


PRODUCT BRIEF 


SIGNALS PROVIDED ON MODULE PINS 


Signal Name 

Type 

Description 

MD00-MD31 

I/O 

Registered memory data lines to/from main memory system. 

MDP0-MDP3 

I/O 

Parity bits for data lines. 

MA00-MA31 

I/O 

Registered address output from the R3000 when used in a single-processor system. When used in 
a multi-processing system the bus is bi-directional. 

MACT0-MACT1 . 

OUT 

Positive-true outputs indicating the states of the R30C0 outputs, 

ACCTYP2 


ACCTYP(0:2), which identify the nature of the data transactions for read/write cycles. MACT(0:1 ) are 
the registered ACCTYP(0:1) lines. 

UINT0#-UINT5# 

IN 

Interrupt inputs. 

EXTCLK 

IN 

External oscillator input (Needed when selecting external clock source). 

CLKSEL 

IN 

Clock source selector. Should be pulled down to select external clock source. 

MRES# 

IN 

Negative-true master reset input. 

RES# 

OUT 

Negative-true reset output, connects directly to R3000 RES# pin. 

SYSOUTA-D 

OUT 

Buffered clock outputs for synchronizing external events. Each of these is an identical clock signal, 
representing the inverted form of the R3000 output, SYSOUT#. 

MEMRD# 

OUT 

Direct negative-true output from R3000, used to indicate that a memory read cycle is in progress. 

MEMWR# 

OUT 

Direct negative-true output from R3000, used to indicate that a memory write cycle is in progress. 

RBSY 

|y| 

Positive-true input used to request a memory read stall initiation and termination. This signal is 
normally held in its asserted state and deasserted at the completion of stalls. 

WBSY# 

IN 

Negative-true input used to request a busy indication for subsequent memory write operations. 

CPCO 

IN 

Direct input to the R3000, used to indicate the size of the data (block, word, byte, or other) for memory 
read cycles. 

CPC1 

OUT 

Connection between the R3000 Processor and the R301 0 FPA, indicating the statusof the conditional 
branch. This pin is provided for diagnostic purposes, only. 

CPC2, CPC3 

I/O 

Direct connections to R3000 pins. 

EXC# 

OUT 

Direct output from R3000, indicating the EXC# signal between the R3000 and the R3010. 

BERR# 

IN 

Negative-true input to R3000, used to indicate a bus error in main memory. 

FPA# 

OUT 

Negative-true output for R3010, indicating the presence of R3010 FPA on the module. 

FPINT# 

OUT 

Negative-true R301 0 interrupt request. Connected on-board to INTI # or INT3# ( jumper selectable). 

RCOND(0:3) 

IN 

Reset condition lines. Used to select a reset vectors set from a given choices. 

WPERR 

OUT 

Positive-true output used to indicate data parity error while write to memory operation. 

RPERR 

OUT 

Positive-true output used to indicate data parity error while read from memory operation. 

RDEN# 

IN 

Negative-true input used to latch data from the main memory bus in the read/write buffers. 

WRCTL# 

IN 

Negative-true input used to latch data from the R3000 CPU in the read/write buffers. 

WOE# 

IN 

Negative-true input used to write data from the read/write buffers to the main memory. 

AEN# 

IN 

Negative-true input used to latch the R3000 address in the address buffers. 

AOE# 

IN 

Negative-true input used to enable the output of the R3000 address from the address buffers to the 
main memory bus. 

EXAEN# 

IN 

Negative-true input used to latch an external address in the address buffers. 

EXAOE# 

IN 

Negative-true input used to enable the output of the external address from the address buffers to the 
R3000/cache bus. 


7.8 
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R3000 RISC CPU MODULE 


PRODUCT BRIEF 


RELATED PRODUCTS 
Prototyping System 

The 7RS110 module can be placed into immediate ser- 
vice using our flexible 7RS310 Prototyping Platform. The 
system includes two boards: a general purpose CPU board, 
and a personality card that interfaces the module to the CPU 
board. 

The CPU board contains 1 Mb of main memory, 256K of 
EPROM, two RS232 serial ports, an 8254 counter/timer, 
and an 8-bit parallel port accessible through a dual port 
RAM. Four 50-pin connectors provide access to all the 
address, data, and control signals for external connection to 
additional hardware on, for example, a wire-wrap board. 


The system includes IDT’s IDT/sim System Integration 
Manager, which provides facilities for downloading code, 
examining memory, and stepping through programs. 

The personality card is on a separate board, and provides 
a bed for the module, necessary control signals, and con- 
nectors for an HP 16500 Logic Analyzer. 

Code for the R3000 can be created on a MIPS develop- 
ment system, on IDT’s MacStation™ system, or using IDT’s 
Multi-Host cross assembler and compilerproducts, available 
for a variety of machines. Assembled code can be down- 
loaded into the Prototyping System for execution and debug. 



Module Prototyping Platform. 

The card on the left is the personality card with a module; the card on the right is the general purpose CPU. 
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ORDERING INFORMATION 


Ordering Part Number 

CPU 

FPA 

l-cache 

D-cache 

Speed 

Other 

7RS1 10N55A16A 

R3000A 

R3010/A 

32K 

32K 

16 MHz 


7RS1 10N55A20A 

R3000A 

R3010/A 

32K 

32K 

20 MHz 


7RS1 10N55A25A 

R3000A 

R3010/A 

32K 

32K 

25 MHz 


7RS110N55A33A 

R3000A 

R3010/A 

32K 

32K 

33 MHz 


7RS1 10F55A16A 

R3000A 

R3010/A 

32K 

32K 

16 MHz 


7RS110F55A20A 

R3000A 

R3010/A 

32K 

32K 

20 MHz 


7RS1 10F55A25A 

R3000A 

R3010/A 

32K 

32K 

25 MHz 


7RS1 10F55A33A 

R3000A 

R3010/A 

32K 

32K 

33 MHz 


7RS1 10N54A16A 

R3001 

R3010/A 

32K 

16K 

16 MHz 


7RS1 10N54A20A 

R3001 

R3010/A 

32K 

16K 

20 MHz 


7RS1 10N54A25A 

R3001 

R3010/A 

32K 

16K 

25 MHz 


7RS1 10N54A33A 

R3001 

R3010/A 

32K 

16K 

33 MHz 


7RS1 10F54A16A 

R3001 

R3010/A 

32K 

16K 

16 MHz 


7RS1 10F54A20A 

R3001 

R301 0/A 

32K 

16K 

20 MHz 


7RS1 10F54A25A 

R3001 

R3010/A 

32K 

16K 

25 MHz 



ADDITIONAL INFORMATION 

For more details on the 7RS110 module, refer to the 7RS110 Technical Specification and User's Manual. 
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RISC DEVELOPMENT SUPPORT 


The R3000 RISC processors are blessed with the most 
powerful software development tools available for any ma- 
chine. The MIPS workstation and server products make it 
possible to have many users on a well-controlled UNIX 
software development, executing and debugging code on 
native systems. IDT, as a MIPS partner, is an authorized 
reseller of these systems and the software support products 
that go with them. 

Additionally, IDT has developed its own hardware and 
software development products which are more closely tuned 
with the needs of the small system or embedded control 
system designer. These include a low-cost, single-user 
R3000 workstation built around a Macintosh computer (the 
"MacStation"), and a variety of hardware prototyping tools. 

For compiling C language programs, there is no better 
compiler anywhere than the MIPS C-compiler. The compiler 


and the CPU architecture were designed togetherto produce 
the highest performance possible running compiled C pro- 
grams. Other languages, such as ADA and FORTRAN, are 
also supported on the MIPS systems. IDT's MacStation can 
be used to run the MIPS C-compiler as well. 

IDT has independently developed a multi-platform cross C- 
compiler for the R3000 architecture. It is currently hosted on 
seven different platforms, including the ubiquitous 286 PC 
running MS-DOS. Although slightly less efficient than the 
M IPS C-compiler, it offers the advantages of readily available, 
inexpensive hosts and an integrated floating point library. 

As with any other widely used processor, many support 
products have been developed by independent companies. 
These include ICE systems, Logic Analyzers and more. A 
partial list of third party suppliers is included at the end of 
this section. 
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FEATURES: 

• Based on Very High Performance MIPS® RISC 
Technology 

• Binary compatibility across the complete line of MIPS 
RISComputers™, RISCstations™ and the IDT 
MacStation™ family 

• Native Development Environment for MIPS ISA CPUs 

• RISC/os™ MIPS SVID compliant UNIX® operating 
system with System V and BSD converged 

• World class optimizing compilers including C, Pascal, 
ADA, PL/1 , FORTRAN, COBOL 

• Networking standards such as TCP/IP and NFS™ for 
connections to SUN, DEC, and other equipment 

• X Terminal support via RISCwindows™ 

• Source level symbolic debugging 

• Supports a wide range of development tools 

— System Programmer's Package 

— IDT floating point library 

— IDT System Integration Manager 

— Cache-2000 and Cache-3051 TargetSystem™ 
modeling tools for performance projections 

• MIPS host platforms include: 

— RC 3230/Magnum 3000 desktop workstation family 

— RC 3240 deskside server family 

— RC 3260 pedestal departmental server 

DESCRIPTION: 

This data sheet contains an overview of some of the tools 
and support available when developing microprocessor based 
systems using the MIPS development host platforms. These 
systems provide an easy-to-use, robust environment for the 
hardware and software development of today’s high-perfor- 
mance RISC based systems, and greatly reduce the time and 
effort required to bring an application to market. 

As in any development environment, ongoing enhance- 
ments are made to enrich the tool set. Detailed information 
on current products and enhancements, new tools, and third- 
party support products is available from your local IDT sales 
representative. 

MIPS DEVELOPMENT SYSTEMS 
CHARACTERISTICS 

MIPS' Development Systems, available from IDT, provide 
a large tool box when developing software and designing 
hardware architectures for any of the MIPS ISA CPUs. Based 
on the UNIX® operating system, RISC/os™ is SVI D compliant, 
converging BSD and System V. Because RISC/os™ is a 
multiuser, multi-tasking operating system, many users can be 


active on the host at the same time and each user may initiate 
multiple tasks. These include simple multi-tasking capabilities 
from print spooling— the ability to queue listings to print while 
still compiling or editing — to more than 200 users all compil- 
ing, editing, linking, etc. 

User protection indigenous to UNIX®, easily managed by 
the System Administrator, is a huge asset in developing, 
testing and maintaining systems software. Configuration 
Management, the job of software management, is much 
easier with these user isolation capabilities. For example, 
each user can belong to a group which is developing a certain 
section of the project. As a user builds and tests a subpro- 
gram, he submits that tested unit to Configuration Manage- 
ment for archive. The librarian can give other users access, 
but they cannot modify the code. This enforces coding rules 
as code is reviewed before it is accepted. The style may seem 
restrictive, but has many benefits: it ensures reliable code in 
the library, enforces testing procedures, and keeps code 
stable. 

Many features of standard UNIX® operating systems ease 
the burden of program development. VI, the screen editor, 
(instead of a line editor), is a simple and powerful text editor 
that can be used for program entry. It has all the standard text 
editor features, plus an array of advanced capabilities for the 
experienced programmer. Saving and backing up important 
files is easily achieved by use of the TAR command. TAR, 
used for streaming tape backup, lists directories on tapes, can 
extract files from tapes, compare them, and manipulate them 
easily. Lastly, the Makefile command helps compile and link 
many subprograms easily. It is a simple list of all the files you 
need to compile and then link together. If during the debug 
process one subprogram is modified, invoking Makefile will 
find the single subprogram that has to be compiled again (by 
time/date stamps) and then relink automatically. 

The CPU bandwidth required to be efficient at doing 
multitasking is well beyond the capability of today's medium 
range PC or Macintosh™ computers. Many advances in 
personal computer technology have stretched these single 
user architectures into multiuser domains, but to reduce the 
latency when adapting these systems requires more raw 
MIPS. These MIPS development hosts are thus ideal forthe 
medium to large scale development team required for today's 
high-performance RISC-based applications. 



CONNECTIVITY 

The MIPS Development Platforms have been designed to 
easily integrate into existing computer environments. Inmost 
cases, these platforms easily integrate and allow the users to 
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access the system through their existing terminals, 
createsource files with existing terminals for download, com- 
pile and link, or more extensively use the systems file and 
printer resources. Solutions to a wide variety of integration 
goals and computing environments are readily available as 
parts of the MIPS Development platforms. 

RS-232 Connections 

The simplest method to interface to the MIPS Development 
Platform is to attach RS232 terminals directly. There are 
typically a numberof direct connections, including Comm 0 for 
the system console (where the system is booted and system 
administration functions are often performed) . These terminal 
ports allow a number of different RS-232 devices to be 
connected to the host system, including IBM compatible PCs 
running terminal emulation software or modem connections 
fortelephone dialup. BAUD rates, parity, and othercommuni- 
cation protocol information are easily configured. 

Network Connections and X-Terminals 

MIPS development systems are capable of integrating into 
all common computer networks, including Ethernet and 
DECnet. If the current environment includes thick or thin wire 
Ethernet the connection is simple, since MIPS systems sup- 
port NFS-Network File System. NFS™ handles file transfers 
across a network, making all of the file resources of the 
network appear as if they belong in each system on the 
network. Since TCP/IP runs on the system, Ethernet file 
transfer and file sharing is transparent. This allows MIPS' 
systems to easily connect to DECstations, Sun workstations, 
and any other Ethernet system using these conventions. 
Using Ethernet, X Terminal support is provided by 
RISCwindows™ . This is a MIPS native implementation of the 
standard X Window System™ combined with OSF Motif™. 
Alternately, the RISComm software package can be used to 
integrate the MIPS host into the DECnet computer network. 

PC Compatible Connections 

IBM PCs or compatibles can be connected directly through 
RS232 or Ethernet. For RS232 connections, a simple terminal 
emulator like Procomm Plus works well. The programmer can 
create source code using DOS applications and then transfer 
it to the MIPS platform using the Kermit protocol inside of 
Procomm Plus. Alternatively, the programmer can share files 
in the PC and the system by using products from Novell. They 
provide a software tool that runs in both systems allowing the 
user to compile files that exist on the PC, or run application 
programs on the PC and use the file system on the MIPS' 
platform transparently. 

Macintosh 

Several solutions are available to utilize existing Macintosh 
equipment. uShare™ provides a communication network 
between the Apple Macintosh and MIPS platforms. Once in 
place, it provides Mac users with server capabilities, elec- 
tronic mail, print spooling, data backup, and terminal emula- 
tion allowing interactive work. This environment allows pro- 
grammers to create code on the Mac using standard word 
processing applications, then transfer the file to the MIPS 


platform for compile and test. Caymen makes The Gator Box, 
which provides a gateway or bridge from Apple-Talk to the 
MIPS platform. Disk sharing and file sharing are capable with 
this tool as well. 

IBM Mainframes 

SNA 3270 allows RISC/os™ users to connect to IBM and 
others using SNA® protocol. Using the SNA 3270 software, 
the MIPS platform looks like an IBM terminal using any 
standard UNIX® terminal, eliminating the need for actual IBM 
terminals and printers. Support for standard IBM file transfer 
protocol for downloading and print spooling is possible with 
the session manager. 

MIPS OPTIMIZING COMPILERS 

A very important aspect of performance and development 
ease is the compiler technology. Fortunately, this is a funda- 
mental strength of MIPS. 

The MIPS instruction set and compiler technology were co- 
developed before any silicon was architected. Man years 
were expended developing, tuning, and testing the instruction 
set and the ability of compiler technology to generate efficient 
object code from high-level languages. After this rigorous 
analysis, the silicon tradeoffs were made. The result is a chip 
and compiler toolset designed to work with each other; this is 
dramatically different from traditional CPU/compiler develop- 
ment, where a chip manufacturer designs a chip and then 
expects compiler writers to "do their best with it". 

MIPS optimizing compiler technology, widely regarded as 
the best in the industry, is based on mathematical models that 
work 100% of the time. This is substantially better than the 
"portable compiler" technology used for many other proces- 
sors, which frequently uses heuristic rulesforoptimization that 
may not result in as robust operation. Today, five levels of 
compiler optimization are available. Each individual level 
further optimizes code size or execution speed. Local opti- 
mizations include: Optimal Calling Sequences, Branch to 
Branch Optimizations, Local Common Subexpression, 
Schedule FP Units, Pipeline Scheduling, Constant Folding, 
and Code Selection. Global optimizations, which optimize 
across procedure boundaries include: Moving invariant code 
out of loops, Strength Reduction, Register Assignment, Glo- 
bal Common Subexpression, Shrinkwrapping, Inter-procedural 
Register Allocation, and Procedure Inlining. 

The structure of the MIPS compilers exemplifies "State of 
the Art" design; the compiler suite is architected to use a 
common optimizing structure, called the back end, and mul- 
tiple language parsers or front ends. This foundation yields 
many benefits: improvement in the back end improves all 
languages, and more front ends (languages) can be easily 
added at any time and obtain the full benefit of MIPS optimi- 
zations. Also, the structure allows combinations of languages, 
including assembly, to be incorporated in a single program. 
MIPS compiler suite includes C, FORTRAN, Pascal, ADA, 
COBOL, PL/1, assembly, and plans to add C++ and ANSI C 
soon. Languages from third party vendors are available as 
well. 
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TOOLS FOR DEVELOPMENT 

MIPS development hosts include support for both hard- 
ware planning, software development, and software integra- 
tion. These tools allow application architects to define a 
system capable of meeting the cost/performance goals of the 
application, allow the software to be developed quickly and 
efficiently, and ease the process of integration of the applica- 
tion software onto the final target system. 

Hardware Architecture Evaluation 

The quality of an application system is determined by its 
ability to meet the needs of its marketplace. Good systems are 
well planned, well thought out, well architected, and are 
maintainable. Tools for hardware architecture definition help 
measure how close the final system will actually come to 
meeting the systems’ predefined goal, before time and money 
are spent on developing prototypes. MIPS tools include the 
ability to accurately guage the performance of application 
software on a given memory architecture, and allow the 
hardware architect to make cost performance tradeoffs in the 
target system even before schematics are fully developed. 
These tools are contained in a product called the System 
Programmer's Package-SPP, and complement many of the 
software development tools discussed below. For example, 
Sable, a symbolic debugger/instruction simulatorcan be used 
to profile and develop kernel code. Pixie, Pixstats, and Profiler 
work to provide detailed information about the dynamic be- 
havior of the software in the proposed system including 
information about cache performance, "hot spots” and dead 
spots in the code, and the dynamic instruction mix used by that 
software. 

DBX Source-Level Debugger 

This standard debugger for UNIX® is excellent for testing 
and finding software problems at the source code level. The 
debugger works for C, FORTRAN, Pascal, and assembly 
language. DBX is an easy tool to use; simply compile your 
code with the -g option (this keeps the symbol table from being 
stripped) and then invoke the debugger by typing “DBX 
command_file_name source_file_name". The 
command_file_name allows a set of DBX commands to run 
for set up, like setting variables, setting breakpoints, running 
to a specific point and displaying variables. Simple commands 
for DBX are; RUN, LIST, DUMP (list data about current 
procedure), STEP, STOP AT line#, STOP IN procedure_name, 
UP/DOWN (traverse stack activation levels), etc. Complex 
commands like “stop VAR in PROCEDURE if EXPRESSION” 
are also supported. DBX lets you use all the resources within 
the language except coprocessorzero, which can be debugged 
using "Sable”. 

Both the SPP and the IDT System Integration Manager 
allow the capabilities of DBX to be brought to source level 
debugging on a remote target system. This provides a familiar 
environment to the programmer as the task of integrating 
application software onto the target system is performed. 

Program Behavior Analysis Tools 

Pixie is a UNIX® program that provides statistics about a 
program’s dynamic behavior. One type of data collected 


shows all the called routines, how many times they were 
called, how many instructions per call, and the percent of time 
spent in that call. This information is used to find out where 
most of the program time is spent, allowing the programmerto 
focus on those routines which dramatically affect the perfor- 
mance of the application. Another type of data collected 
shows the locality of the code. This helps to predetermine how 
much cache would be appropriate for this program. It also 
goes into detail on floating point usage, integer multiply/divide 
usage, all possible interlocks, load/branch nops, and more. 
For it to collect this type of data, it actually instruments your 
object code; the instrumented code is executed normally, and 
the output is formatted using the pixstats (pixie statistics) 
program. 

The Profiler is another UNIX® tool that further defines code 
behavior. The user can concentrate in specific areas of 
program behavior by focusing the profilers attention to those 
items. Separate lists can be generated covering procedure 
information. They are: sorted by total time spent in each 
procedure, sorted by times called, sorted by number of cycles 
executed, sorted by number of clocks executed, sorted by 
number of clocks per line inside a procedure, and number of 
procedures never called. Again, this tool set allows the 
programmer to tune those areas of the code which will most 
affect the end system performance, thus increasing program- 
mer efficiency. 

The System Programmer's Package 

SPP is a toolbox containing programs that help system 
developers to build, test, and download software to the target 
hardware. Among the utilities Included in the SPP are the 
following: 

• Cache2000; a hardware environment simulator which 
allows the system architect to evaluate the performance 
of different types of memory systems 

• Sable; a software environment simulator which allows 
symbolic kernel code debug 

• DBGMON; a set of tools for downloading into the target 
machine, and interfacing to DBX to perform remote target 
source level debugging. 

• common I/O routines and drivers used in many sys- 
tems. 

SPP is provided in source form and is written in C, allowing 
for easy modification and tailoring. SPP is covered in detail in 
its own data sheet. 

SUMMARY 

MIPS Development platforms provide all the tools neces- 
sary for hardware and software system design. UNIX comple- 
ments the tool set by providing an engineering environment 
and communication capabilities to support a large or small 
engineering team. This environment supports the full devel- 
opment cycle required to be successful with and quick to 
market with applications based on the IDT/MIPS RISC pro- 
cessor family. 

ADDITIONAL INFORMATION 

Additional information about specific models and capabili- 
ties is available from your local IDT sales representative. 
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FEATURES: 

• Use with any module to form a complete R3000 
based computer system 

• Includes 1024K bytes of static RAM main 
memory, and 256K bytes of EPROM 

• IDT's System Integration Manager In EPROM. 
Supports downloading software, single step, 
and other debugging needs 

• Two RS232C serial ports (68681), parallel port 
(IDT7134 dual-port RAM), and programmable 
interval timer (8254) 

• Address bus, data bus, and all necessary con- 
trol signals provided on connectors (four 50-pin 
IDCs) to permit user expansion 

• Direct connections to HP 16500 Logic Analyzer 

• Fastest way to get an R3000 system up and 
running 


DESCRIPTION: 

The IDT7RS300 series Prototyping Platforms consist of 
two separate PC boards which connect together by means 
of convenient ribbon cables to form the total system. 

The System Board contains the non-CPU system func- 
tional units: static RAM main memory, EPROM monitor, two 
serial ports, a parallel port (utilizing an IDT7134 Dual-Port 
RAM), a free-running programmable timer, and expansion 
connectors to permit additional hardware to be added. It is 
designed to be placed on a flat table-top surface. Standoffs 
are provided for physical support. Access to all components 
on the board is readily available with this packaging approach. 
The Personality Board contains control logic and a socket 
for the a particular RISC module. The Personality Board is 
also designed for table-top use. 

The EPROM on the System Board contains IDT's System 
Integration Manager (IDT/sim), a powerful tool for down- 
loading software and debugging both hardware and software. 
Drivers can easily be added to support other I/O devices or 
changes in I/O addresses. 

Software can be downloaded into the board from a MIPS 
machine, from an IDT MacStation development system, or 
from a PC running IDT's Cross Development Software. 



The 302 version of the Prototyping Platform. 

The System Board Is on the right; the personality card and 102 module on the left. 
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FLEXIBLE PROTOTYPING PLATFORMS 
System Description 

The 7RS300 series RISC Prototyping Platform is de- 
signed to simplify the initial prototyping of both hardware 
and software for systems using one the the IDT RISC 
Subsystem Modules. The System Board is very general, 
and is the same in all of the 300 series Platforms. It contains 
basic control logic, mostly in PALs, 1 megabyte of static 
main memory, 256K of EPROM, a counter/timer, and I/O 
ports. Static RAM is used for main memory to provide the 
simplest interface to the module. The EPROM contains 
IDT's System Integration Manager in about 80K; the rest is 
available for user software. 

The System Board connects to a personality board for the 
module through a pair of ribbon connectors. Each module 
architecture uses a different personality board. The per- 
sonality board provides such features as clock generation, 
R3000 reset and initialization, read and write buffers, etc., to 
the extent that they are not already on the module. The 
personality board also contains five 20-pin plugs that can be 
directly connected to an HP 16500 series Logic Analyzer, 
and provides a uniform interface to the System Board. 

System Board Hardware 

The System Board is powered by a single 5 volt supply 
connected to a plug on the board. The plug conforms to the 
standard used for PCs, so an ordinary inexpensive PC 
power supply works easily with the board. A terminal can be 
connected to one of the RS-232 ports to act as the terminal 
for the Software Integration Manager. The other serial port 
is generally used to download software from some host 
system. Alternatively, there is an 8-bit wide parallel port bu ilt 
using dual port RAM that can be used for higher speed 
download. 


Four 50-pin IDC (3M) connectors are configured for 
connecting additional hardware to the System Board. They 
contain the following signals: 

• 32 bits of address 

• 32 bits of data, and 4 parity bits 

• SYSOUT (buffered clock from the R3000) 

• RESET# (copy of the R3000's Reset signal) 

• Parity and Address output enables from the address 
and data registers (to permit tri-stating other data onto 
these lines). 

• Six interrupt lines to the R3000. These are registered 
or not, depending on the module. 

• The four byte Write Enable signals. 

• Five decoded chip select outputs from the upper 16 
bits of address (1 FE6 through 1 FEE). 

• MEMRD#, used to enable output devices in the ex- 
pansion system during data read cycles. 

• Auxiliary input and output signals from the 68681 dual 
UART 

• MREQ# and XACK# handshaking signals for control- 
ling the timing of data transfers. 

Personality Board Hardware 

The personality board connects to the system board 
through two ribbon connectors. It contains a cut out area 
and plugs which accept the appropriate module. There are 
two five-volt power connectors, again using standard PC 
plugs. One power supply is for the personality card, the 
other for the module. 

Five connectors are pre-wired to connect the modules 
signals to an HP logic analyzer. Because of the speed of the 
signals in the R3000 system, the connectors are placed on 
the slow side of the read/write buffers, so for disassembly 
and trace purposes, the R3000 must be run uncached. 

Software Included 

The System Board contains IDT's System Integration 
Manager (IDT/sim) in EPROM. 
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FUNCTIONAL BLOCK DIAGRAM 



Block Diagram of the 7RS300 Series Prototyping Platforms 
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Layout of the series 300 Prototyping Platform. 
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ORDERING INFORMATION 


Ordering Part Number 

Speed 

FOR USE WITH THE 7RS101 ARCHITECTURE 

7RS301-16 

16 MHz 

7RS301-20 

20 MHz 

7RS301-25 

25 MHz 

7RS301-30 

30 MHz 

FOR USE WITH THE 7RS102 ARCHITECTURE 

7RS302-1 6 

16 MHz 

7RS302-20 

20 MHz 

7RS302-25 

25 MHz 

FOR USE WITH THE 7RS103 ARCHITECTURE 

7RS303-16 

16 MHz 

7RS303-20 

20 MHz 

7RS303-25 

25 MHz 

FOR USE WITH THE 7RS104 

Contact Factory 


FOR USE WITH THE 7RS107 ARCHITECTURE 

7RS307-1 6 

16 MHz. 

7RS307-20 

20 MHz 

7RS307-25 

25 MHz 

7RS307-33 

33 MHz 

FOR USE WITH THE 7RS108 ARCHITECTURE 

7RS308-25 

25 MHz 

FOR USE WITH THE 7RS109 ARCHITECTURE 

7RS309-25 

25 MHz 

7RS309-33 

33 MHz 

FOR USE WITH THE 7RS1 10 ARCHITECTURE 

7RS310-20 

20 MHz 

7RS310-25 

25 MHz 

7RS31 0-33 

33 MHz 


INCLUDED WITH SYSTEMS 

Each Prototyping Platform includes the System Board, 
completely populated with 1 Mb of RAM and 256K of 
EPROM, with the Software Integration Manager in the 
EPROM. Each System also includes the appropriate per- 
sonality card for the module architecture indicated and 
configured forthe speed indicated. Documentation includes 
complete schematics for both the system board and the 
personality board, including all the PAL equations for the 
control circuitry. 


Auxiliary Download Program 

For downloading code from a MIPS machine into an 
evaluation board. This software includes programs 
to convert MIPS object code into S-records and to 
download either ASCII or binary S-records to a 
remote target. This software is only needed with 
MIPS computers; all other machines (including the 
MacStation) have standard utilities available to 
perform this function. 

MIPS download utility 7RS950BUU 

Supplied on QIC-24 TAR tape. 
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FEATURES: 

• Simple and direct connection to HP 16500A 
Logic Analyzer System 

• Probe points to 32 address, 32 data and 16 
control signals of R3000 

• Several clocks available for signal strobes 

• Compact physical size permits its use in target 
system with minimal impact on spacing 
requirements 

• Setup files for 16500A Logic Analyzer assures 
speedy startup 

• No active components 

• Compact design assures minimal added lead 
capacitances (approx 5 pF) 


DESCRIPTION: 

The IDT7RS363 is an adapter card intended for use in 
performing diagnostics on the operation of the IDT79R3000 
RISC Processor on a Hewlett-Packard model 1 6500A Logic 
Analysis system. It contains no active components. In- 
stead, it is used as a socket adapter for the R3000, and all 
address, all data, and many control lines are made acces- 
sible for capture by the logic analyzer. It may only be used 
with the pin grid array (PGA) package of the R3000 and 
requires the logic analyzer system to be equipped with 5 HP 
Termination Adapters, P/N 01650-63201, to provide for the 
direct connection to the analyzer input pods. 

For ease of setup, a diskette is provided, as a part of the 
7RS363, which contains files loadable directly into the HP 
logic analyzer. These files automatically set up the logic 
analyzer by assigning the pods and the individual input 
channels directly to the signals captured from the R3000. In 
this way, the logic analyzer display will immediately repre- 
sent the captured signals with all the proper signal names 
displayed. 

The 7RS363 may also be used as a simple diagnostic 
tool, separate from its use as a logic analyzer adapter card. 
This is accomplished by virtue of the fact that all necessary 
signals of the R3000 are immediately accessible as test 
points on the card. 


IDT7RS363 


R3000 PGA ADAPTOR 


Integrated Device Technology, Inc. 









IDT7RS363 

R3000 PGA ADAPTOR 


PRODUCT BRIEF 



8.3 


2 





£ 

fPN® 

■ 

R3000 DISASSEMBLER IDT7RS364 

ll 



FOR USE WITH THE HP 16500 



LOGIC ANALYZER 

Integrated Device Technology, Inc. 

1 



FEATURES: 

• Disassembler for the IDT79R3000 RISC micro- 
processor on the HP16500 logic analyzer 

• Interfaces to the IDT7RS301 And IDT7RS302 
TargetSystem Boards 

• Allows actual code tracing 

• Direct Connection to IDT 7RS300 series Proto- 
typing Platforms 

• Can Be Used with Any R3000 System 


DESCRIPTION: 

The IDT7RS364 Disassembler for the HP16500 logic 
analyzer is a useful tool meant to ease the task of debugging 
software run on the IDT7RS301/302 TargetSystem Boards. 
The HP16500 itself allows the engineer to capture the 
CPU's executed opcodes. However, the IDT7RS364 will 
allow the engineer to trace the actual code executed by the 
processor without having to resort to lookup tables. The 
IDT7RS364 will decode the opcodes captured by the logic 
analyzer into the correct instruction mnemonic so that the 
engineer will know the exact processor state. 

The IDT7RS364 will only capture main memory accesses 
(see Figure 1). Therefore its operation is guaranteed only if 
the software is being run in uncached address space. 
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1FC04AD0 

34 

1FC04AD4 

35 

0001F96C 

36 

1FC04AD8 

37 

1FC04B20 

38 

1FC04B24 

39 

1FC04B28 

40 

1FC04B2C 

41 

1FC04B30 

42 

1FC04B34 

43 

0001F96C 

44 

1FC04B38 

45 

0001F970 

46 

1FC04B3C 

47 

000018DC 

48 
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R3000 MNEMONIC | 

Hex 


LW si, 0x0034 (sp) 

B 0xlFC04B20 

LOAD DATA 0x00006602 

NOP 

LI at, 0x00006601 
BEQ si, at, 0xlFC04ADC 
NOP 

MOVE aO, sO 
LW al, 0x0034 (sp) 

LW a2, 0x0038 (sp) 

LOAD DATA 0x00006602 
LW tl, 0x0020 (sO) 

LOAD DATA 0x00000000 
NOP 

LOAD DATA 0xBFC162D0 
LW t2, 0x0014 (tl) 
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Typical Screen Display 
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The 7RS301 Prototyping Platform for the 7RS101 Modules. 

The five connectors for the logic analyzer are on the module personality card. 


ORDERING INFORMATION 

The Disassembler is shipped on a 3.5" diskette for the HP16500 series Logic Analyzers. Included is complete set-up 
documentation and drawings showing how to use it with systems other than the 7RS300 series Prototyping Platforms. 

Disassembler 7RS364 
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R3000 AND R3001 

IDT7RS382 
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Integrated Device Technology, Inc. 

1 

EVALUATION BOARDS 

IDT7RS383 


FEATURES: 

• Complete RISC Example System 

• Available for R3000 or R3001 CPUs 

• Supplied with complete schematics and PAL 
equations 

• Includes R3010 Floating Point Accelerator, Data 
and Instruction Caches (16k bytes each), 128Kb 
of main memory, and 128Kb of EPROM 

• IDT's System Integration Manager included in 
EPROM 

• Connectors provided for easy expansion 

• CPU and FPA in PGA Sockets for easy connec- 
tion to Logic Analyzers 


DESCRIPTION: 

The IDT7RS382 and 7RS3833 are complete RISC sys- 
tems intended for use as low-cost evaluation systems They 
are completely self-contained on a single printed circuit 
board and requires only a simple CRT terminal and a 5 volt 
power supply for operation. 

The board containsthe RISCCPU (R3000onthe7RS382 
and R3001 on the 7RS383) and a IDT79R3010 Floating 
Point Accelerator (FPA), as well as 1 6 Kbytes of Cache RAM 
for each of the data and instruction caches, EPROM, static 
main memory, two serial ports, a free-running program- 
mable timer, and expansion connectors to permit external 
systems to be added. In addition, the system also contains 
clock timing, reset initialization logic, and read/write buffer 
control logic. 

The EPROM contains IDT's powerful System Integration 
Manager (IDT/sim), a debugging monitor that supports 
download of code from host systems, execution control 
commands, memory probing, and I/O. 

The board is 1 1 inches long and 7.5 inches wide and is 
designed to be placed on a flat table-top surface. Standoffs 
are provided for physical support. Access to all components 
on the board is readily available with this packaging 
approach. 










IDT7RS382, IDT7RS383 
RISC EVALUATION BOARDS 


PRODUCT BRIEF 


COMPLETE SINGLE BOARD COMPUTERS 

The 7RS382 and 7RS383 Evaluation Boards are com- 
plete single board RISC computers, requiring only a 5 volt 
power supply and an RS-232 terminal for operation. 

Both systems are intended as examples of typical de- 
signs using the R3000 and R3001 CPU chips. All the 
schematics and details of the designs are supplied with the 
boards. 

The boards can also serve as a prototyping platform. 
Connectors are provided to add additional hardware on a 
wire wrap board. A 20-pin plug, J5 on the drawings, 
provides 1 6 control signals that can be connected to a logic 


analyzer. Both the CPU and the FPA chips are mounted in 
PGA sockets, so a PGA adaptor (7RS363) can be used to 
connect all the CPU pins to a logic analyzer. 

The 7RS383 (but not the 7RS382) can accept larger RAM 
and EPROM devices, supporting up to 512 Kb of EPROM 
and 2 Mb of static RAM. 

Software supplied in the EPROM permits downloading of 
R3000 code compiled on a PC, MacStation, or MIPS ma- 
chine. Commands are available to set break-points, single 
step, examine and modify memory, etc. 


SIGNALS SUPPLIED ON EXPANSION CONNECTOR 


Signal Name 

1 or 0 

Description 

EA00-EA31 

I/O 

32-bit buffered address bus 

ED00-ED31 

I/O 

32-bit buffered data bus 

SYSOUT 

o 

Buffered SYSOUT Clock from CPU; used to synchronize data transfers 

MRES# 

o 

Copy of the Reset signal to the CPU 

MREQ 

o 

Memory Request output (handshaking signal for data transfers) 

XACK# 

1 

Acknowledge input (handshaking for data transfers) 

OP3-OP5 

o 

Auxiliary output pins from the 68681 UART 

IP4-IP5 

1 

Auxiliary input pin to the 68681 UART 

WEA-WED 

o 

Write Enables for the four bytes of the data word 

UCS1A-UCS1D 

o 

Four chip select signals decoded from the high order address bits. 

INTO 

1 

Interrupt input to the R3000 

PMRD# 

o 

Memory Read output signal. Used to enable output drivers in the expansion system during data 
read operations 






































IDT7RS382, IDT7RS383 
RISC EVALUATION BOARDS 


PRODUCT BRIEF 


FUNCTIONAL BLOCK DIAGRAM 


CTRL ADDRLO TAG+ACCTYP 


INSTRUCTION-CACHE 
3-71586 
(16K bytes) 



The 7RS383 (with R3001) does not require parity; 
the 7RS382 (with R3000) includes parity generation logic on the board. 
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IDT7RS382, IDT7RS383 

RISC EVALUATION BOARDS PRODUCT BRIEF 


PHYSICAL LAYOUT 


DB25 SERIAL PORTS 
CRT | 

TERMINAL AUXILIARY 1 
CONNECTOR CONNECTOR 



















1DT7RS382, IDT7RS383 
RISC EVALUATION BOARDS 


PRODUCT BRIEF 


ORDERING INFORMATION 

Each unit is shipped with complete schematics and PAL equations. A user's manual includes instructions on 
downloading code, operating the Software Integration Manager, and providing the correct timing interfaces to additional 
hardware. 


Evaluation Boards 

R3000 Based Evaluation Board 7RS382 

R3001 Based Evaluation Board 7RS383 


EPROM Upgrades 

The following part numbers are for updating the evaluation board hardware to the latest version of the 
monitor. 

For Evaluation boards and Prototyping Systems 7RS901BAP 

Use with 7RS382.383, or any 7RS300 series module Prototyping Platform 

Auxiliary Download Programs 

For downloading code from a MIPS machine into an evaluation board. This software includes pro- 
grams to convert MIPS object code into S-records and to download either ASCII or binary S-records 
to a remote target. This software is only needed with MIPS computers; all other machines (including 
the MacStation) have standard utilities available to perform this function. 

MIPS download utility 7RS950BUU 

Supplied on QIC-24 TAR tape. 
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REALS*™ IDT7RS388 

R3000 LASER PRINTER CONTROLLER 
EVALUATION SYSTEM 


FEATURES: 

• Plugs Directly into Canon LBP-SX 8-ppm print 
engine 

• Provides approximately 8x performance of 
LaserWriter II NTX on typical benchmarks 

• 25 MHz R3001 RISController™ CPU with R3010 
Floating Point Accelerator and 16K each of 
Instruction and Data Cache 

• Includes 4 MB of DRAM, socketed 

• Supports up to 4 MB of EPROM, socketed 

• Software Included on EPROM 
Truelmage™ Page Description Language 
PeerlessPage™ Printer Operating System 
IDT/sim Debug Monitor 

• Schematics and Hardware Reference Manual 
included 


DESCRIPTION: 

IDT's 7RS388 REAL8 Laser Printer Controller provides a 
board level, hands-on, design demonstration and 
benchmarking tool for IDT R3000 based, PostScript™, 
Truelmage, or PCL5 class Raster Image Processing (RIP). 

The REAL8 uses a 25 MHz R3001 , with 1 6KB Instruction 
and 16KB Data Caches to rasterize pages at performance 
points from four to twelve times 68020 engine-based con- 
trollers. The board is supplied with Truelmage PDL and 
fonts fordirect speed comparisonswithothercontrollers. All 
the design documentation is supplied, and the design is 
easy to tailor to a variety of cost/performance levels by 
adding additional I/O or deleteing RAM or EPROM. A 
demonstation copy of PeerlessPage Printer Operating Sys- 
tem is also included. PeerlessPage can support a variety of 
I/O drivers and Page Description Languages, including 
multiple PDLs in the same controller. 

The REAL8 controller fits into the industry standard 
Canon LBP-SX laser print engine and provides standard 
LBP-SX “video” output to the engine. 



The REAL8 Controller Board 


RSD PB388/B/3 
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IDT7RS388 REAL8™ 

LASER PRINTER CONTROLLER DEVELOPER’S KIT 


PRODUCT BRIEF 


PRODUCT OVERVIEW 

The IDT7RS388 REALS Printer Controller is an evalua- 
tion vehicle allowing designers to run benchmarks to compare 
the performance of an R3000 based controller with those 
built around other processors. The product includes com- 
plete design documentation, so it provides a design model 
which can be enhanced with other I/O ports, emulations, 
cartridge connectors, front panel controls or other differen- 
tiating features. The PeerlessPage Printer Operating System 
included with the board makes it easy to interface new I/O 
drivers and new printer emulations. 


The REAL8 board runs at 25 MHz, but the same design 
could be used, with appropriate component selections, over 
a range of 16 to at least 33 MHz. 

The ROM and DRAM memory areas provide sockets for 
user experimentation with configurations upto 4MB of each. 
There is adequate RAM space for up to two page buffers (at 
300 dpi), and program/font space in ROM/EPROM for 
PostScript®, Truelmage, PCL5, combinations of emula- 
tions, or increased font storage. The SX video oscillator can 
also be changed to provide 400 dpi output. 

Benchmarking files can be downloaded to the REAL8 
from any IBM/PC or PC compatible over the standard 
Centronics parallel printer port by simply copying each file to 
LPT1 ; the original source application program does not 
have to be running. 


REAL 8 LASER PRINTER CONTROLLER 




IDT7RS388 Controller Fits Canon LBP-SX Print Engine 
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IDT7RS388 REAL8™ 

LASER PRINTER CONTROLLER DEVELOPER’S KIT 


PRODUCT BRIEF 


DEMO SOFTWARE INCLUDED 

The REAL8 EPROMS contain a demo copy of Microsoft 
Truelmage PDL, TrueType fonts, and the PeerlessPage 
Printer Operating System from The Peerless Group. As 
shipped, the board can be plugged directly into a Canon SX 
engine and can print downloaded PostScript files. This 
provides an easy way to compare the performance of this 
R3000 based controller with controllers based on conven- 
tional technology or other RISC machines. 

Microsoft TRUEIMAGE™ PDL 

Truelmage is an open technology Page Description Lan- 
guage for high performance full-function printers. Truelmage 
provides extensive features such as TrueType™ scalable 
fonts, enhanced communication with the Microsoft Win- 
dows™ operating environment and improved printer 
performance, as well as complete Adobe® PostScript com- 
patibility. Truelmage contains aTrueType rasterizer, as well 
as an Adobe Type I rasterizer, and is able to execute Type 
I fonts. 


PEERLESSPAGE™ Printer OS 

PeerlessPage provides a real-time multitasking execu- 
tive/kernel surrounded by graphics and I/O services such as 
font and emulation soft-switching and active port sensing. 
PeerlessPage is a portable, easily extensible foundation 
that supports multiple industry standards and accelerates 
time-to-market by enabling OEMs to quickly and cost-effec- 
tively build differentiated product lines. 

The PeerlessPage OS itself has been ported to the 
REAL8 board and is included in the board's EPROMs. The 
Truelmage PDL interpreter runs on top of PeerlessPage. 
Other PDLs can easily be ported to the REAL8 by taking 
advantage of the graphics and font handling features built 
into PeerlessPage. 

The PeerlessPage Interface manuals are available di- 
rectly from the Peerless Group, and describe the calls 
available to the user. Users can write C programs with the 
appropriate calls, download the compiled code to the REAL8 
board, and execute it in a software development environ- 
ment. 


\ PCL J _ \ Epson t _ 'Front 7 

\ _IV I Crrtnt ' XL24 * cJLun n ' Panel / 
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PeerlessPage Printer Operating System 
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IDT7RS388 REAL8™ 

LASER PRINTER CONTROLLER DEVELOPER'S KIT 


PRODUCT BRIEF 


ARCHITECTURAL HIGHLIGHTS 

The IDT7RS388 uses a 25 MHz R3001 RISController™ 
with optional (socketed) R3010 Floating Point Accelerator. 
The FPA typically provides 15-20 % performance improve- 
ment in PostScript applications and is not required at all for 
PCL applications. 


The IDT7RS388 assumes that Instruction Cache is al- 
ways used, but provides a jumper for disabling the Data 
Cache. Similarly, junpers allow the ROM/EPROM area to 
accommodate different memory configurations (see Hard- 
ware Reference Manual included with Kit). 



ORDERING INFORMATION 

Each REALS Laser Printer Evaluation System includes a controller board for a Canon LBP-SX engine with 4 MB of DRAM 
and with Peerless Page and T ruelmage in EPROM . Also included is a hardware reference manual describing the design in detail, 
including schematics and PAL equations. 

REAL8 Printer Evaluation Board 7RS388 
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Integrated Device Technology, Inc. reserves the right to make changes to the specification in this data sheet in order to improve design or performance and 
to supply the best possible product. 

MacStation, RISC CPU Subsystem, RISController, REAL8 and TargetSystem are Trademarks of Integrated Device Technology, Inc. 

Apple, Macintosh, AppleTalk, LaserWriter, A/UX, MultiFinderare registered Trademarks of Apple Computer, Inc. UNIX is a registered trademark of AT&T. 
MIPS, RISC/os, and RISCompilerare trademarks of MIPS Computer Systems, Inc. NuBus is a trademark of Texas Instruments, Inc. MS-DOS is a registered 
trademark of Microsoft Corporation. Truelmage and Windows are trademarks of MicrosoftCorporation. TrueType is a trademark of Apple Computer. 
Postscript is a trademark of Adobe Systems. 
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FEATURES: R3000 COMPUTER PLUGS INTO 

• 10 VAX MIPS RISC Computer inside any Macin- A MACINTOSH II: 

tosh II The MacStation 2 is an R3000 based workstation consist- 

ing of a Macintosh II computer and a high-performance 

• Includes UNIX V 3 Operating System R3000 CPU that plugs into the NuBus inside the Mac. The 

R3000 CPU runs IDT/ux, IDT's port of MIPS RISC/os UNIX 

• Supplied with MIPS C-Compiler, Assembler, and V 3, in a window under Multifinder. 

Symbolic Debugger The UNIX window is opened by double-clicking an icon 

on the Mac screen. The window is essentially a terminal 

• Will support any MIPS software packages, emulator. When the window is opened, IDT/ux starts up on 

including SPP, SPP/e, FORTRAN the R3000 CPU. Users can switch back and forth between 

the IDT/ux window and other Macintosh windows freely. 

• Uses all Macintosh peripherals for I/O The system includes MIPS powerful C Compiler. Source 

code is written using any Macintosh text editing program, 

• Multifinder and System 7 Compatible and a single command in the IDT/ux window will copy the file 

into the UNIX file system and compile it. Any other software 
written for MIPS RISC/os can be ported to the MacStation 2. 

The MacStation hardware uses two slots in the 
Macintosh. The MacStation software is supplied on tapes 
and requires approximately 1 60 Mb of hard disc space when 
loaded. 


MacStation 2 
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MacStation™ 2 

IDT7RS502 

R3000 DEVELOPMENT SYSTEM 












IDT7RS502 

MacStation™ 2 Development System 

MacStation 2 HARDWARE 

RISC CPU Card 

The CPU card is a 16 MHz R3000 system, using the 
Floating Point Accelerator and 64 Kb each of instruction and 
data caches. The necessary hardware to run UNIX is also 
on the board. EPROMs on the card contain IDT’s System 
Integration Manager (IDT/sim) which provides many debug 
and control features at the monitor level. The board commu- 
nicates with all I/O via the Macintosh NuBus. 

Main Memory 

The MacStation 2 includes 8 Megabytes of memory on a 
separate NuBus card. This permits reasonably large pro- 
grams to execute without excessive disc swapping. The 
memory may be expanded to 1 6 Mb by simply adding an 8 
Mb extension card, available separately. 


PRODUCT BRIEF 

MacStation 2 SOFTWARE 

The MacStation is shipped with all the software on 
tapes. It may be ordered with a 1 60 Mb hard disc with the 
software pre-installed, but tapes are still included for backup. 


FUNCTIONAL BLOCK DIAGRAM 


Macintosh II 
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IDT7RS502 

MacStatlon™ 2 Development System 


PRODUCT BRIEF 




Screen Shot of Macintosh with IDT/ux running. 
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IDT7RS502 

MacStation™ 2 Development System 


PRODUCT BRIEF 


ORDERING INFORMATION 

The MacStation 2 may be ordered in a variety of forms, ranging from the essential boards and software 
to complete systems with software pre-installed. The IDT/ux operating system requires a signed single- 
system license agreement which must be executed before the system can be shipped. Contact your 
IDT sales office for a sample of the license. 

MacStation Boards 7RS502B8-L 

Includes R3000 NuBus CPU card, 8 Mbyte NuBus Memory card, IDT/ux. 

Requires a Macintosh II computer, system 6.01 or later, Apple tape drive, at 
least 160 Mbytes of free disc space. 

MacStation Conversion Kft 7RS502TD8-L 

Everything to convert a Macintosh II to a MacStation. Includes the 
MacStation boards, IDT/ux, a tape drive, a 160 Mbyte Hard Disc and 
cables. 

Memory Expansion Card 7RS502X8 

Adds 8 Mbytes to the Memory Card, raising total available to IDT/ux to 16 
Mbytes. 

Standard Monochrome System 7RS502MXM-L 

Includes the MacStation Boards, IDT/ux, tape drive, 160 Mb external hard 
disc, cables. Also includes Macintosh I lx computer with 4 Mbytes of internal 
memory and 80 Mb internal hard disc, monochrome 13" monitor and video 
card, extended keyboard. 

High Performance Monochrome System 7RS502MFXM-L 

Includes the MacStation Boards, IDT/ux, tape drive, 160 Mb external hard 
disc, cables. Also includes Macintosh llfx computer with 4 Mbytes of 
internal memory and 80 Mb internal hard disc, monochrome 13" monitor 
and video card, extended keyboard. 

Other Macintosh system configurations. 

The MacStation can be supplied with any desired Macintosh configuration. 

Contact factory for a quotation. 


IDT/ux Documentation 

This is an eight-manual set of documentation for MIPS RISC/os (UNIX V 3 with BSD extensions). 
Included are: RISC/os Programmer’s Reference Manual, User’s Reference Manual, System Administra- 
tion Reference Manual, System Administrator’s Guide, Programmer’s Guide, User’s Guide, Streams 
Primer and Programmer’s Guide, Guide to BSD on RISC/os. 

IDT/ux Documentation Package 7RS551BDU 

Other MIPS Software for MacStation 

The following MIPS products are available for the MacStation. All require asigned license agreement 
prior to shipment. Contact your IDT sales office for a sample of the agreement. 

SPP for MacStation 7RS992SMT-L 

Source Code, site license, includes documentation 

SPP/e for MacStation 7RS993SMT-L 

Source Code, site license, includes documentation 
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R3000 DEVELOPMENT SYSTEM IDT7RS503 

Integrated Device Technology, Inc. 
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FEATURES: R3000 COMPUTER PLUGS INTO 

• 20 VAX MIPS RISC Computer inside any A MACINTOSH II 

Macintosh II 

The MacStation 3 is a high perlormance R3000 based 

• Includes UNIX V 3 Operating System workstation consisting of a Macintosh II computer and an 

R3000 CPU board that plugs into the NuBus inside the Mac. 

• Supplied with MIPS C-Compiler, Assembler, and The R3000 CPU runs IDT/UX . 1DT ' S P ort o{ M1PS RlSC/os 

UNIX V 3, in a window under Multifinder. 

The UNIX window is opened by double-clicking an icon 
on the Mac screen. The window is essentially a terminal 
emulator. When the window is opened, IDT/ux starts up on 
the R3000 CPU. Users can switch back and forth between 
the IDT/ux window and other Macintosh windows freely. 

The system includes MIPS powerful C Compiler. Source 
code is written using any Macintosh text editing program, 
and a single command in the IDT/ux window will copy the file 
into the UNIX file system and compile it. Any other software 
written for MIPS RISC/os can be ported to the MacStation. 

The MacStation hardware uses one NuBus slot in the 
Macintosh. The MacStation software is supplied on tapes 
and requires approximately 1 60 Mb of hard disc space when 
loaded. 


Symbolic Debugger 

• Will support most MIPS software packages, 
Including SPP 

• Uses all Macintosh peripherals for I/O 

• Includes Macintosh-independent SCSI and 
serial I/O ports 

• Multifinder and System 7 compatible 



MacStation 3 


RSD PBx503/A/1 DECEMBER 1990 
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IDT7RS503 

MacStation 3 RISC Workstation 


PRODUCT BRIEF 


MacStation 3 HARDWARE 

RISC CPU Card 

The CPU card is a 25 MHz R3000 system, using the 
Floating Point Accelerator and 64 KB each of instruction and 
data caches. The necessary hardware to run UNIX is also 
on the board. EPROMs on the card contain IDT's System 
Integration Manager (IDT/sim) which provides many debug 
and control features at the monitor level. The board can 
communicate with all Macintosh I/O and with Ethernet via 
the Macintosh NuBus. Additionally, the board has its own 
SCSI and serial I/O ports. A separate terminal can be used 
on the serial I/O port to just run command-line Unix, and an 
independent Macintosh compatible hard disc can be used 


on the SCSI port to provide faster disc access to the RISC 
machine by avoiding delays inherent in the NuBus and Mac 
O/S. 

Main Memory 

The MacStation 3 is available with either 8 or 16 MB of 
DRAM. This permits reasonably large programs to execute 
without excessive disc swapping. 

MacStation 3 SOFTWARE 

The MacStation is shipped with all the software on 
tapes. It may be ordered with a 1 60 Mb hard disc with the 
software pre-installed, but tapes are still included for backup. 
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IDT7RS503 

MacStation 3 RISC Workstation 


PRODUCT BRIEF 




Screen Shot of Macintosh with IDT/ux running. 
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IDT7RS503 

MacStation 3 RISC Workstation 


PRODUCT BRIEF 


ORDERING INFORMATION 

The MacStation 3 may be ordered in a variety of forms, ranging from the essential boards and soft- 
ware to complete systems with software pre-installed. The IDT/ux operating system requires a signed 
single-system license agreement which must be executed before the system can be shipped. Contact 
your IDT sales office for a sample of the license. 

8 MB MacStation System 7RS503B8-L 

Includes R3000 NuBus CPU card with 8 Mbyte RAM, IDT/ux. Requires a 
Macintosh II computer, system 6.01 or later, Apple tape drive, at least 160 
Mbytes of free disc space. 

16 MB MacStation System 7RS503B16-L 

Includes R3000 NuBus CPU card with 16Mbyte RAM, IDT/ux. Requires a 
Macintosh II computer, system 6.01 or later, Apple tape drive, at least 160 
Mbytes of free disc space. 

MacStation Conversion Kit 7RS503TD8-L 

Everything to convert a Macintosh II to a MacStation. Includes the 8 MB 
MacStation board, IDT/ux, a tape drive, a 160 Mbyte Hard Disc and 
cables. 

Other Macintosh system configurations. 

The MacStation can be supplied with any desired Macintosh configuration. 

Contact factory for a quotation. 


IDT/ux Documentation 

This is an eight-manual set of documentation for MIPS RISC/os (UNIX V 3 with BSD extensions). 
Included are: RISC/os Programmer’s Reference Manual, User’s Reference Manual, System Adminis- 
tration Reference Manual, System Administrator’s Guide, Programmer’s Guide, User’s Guide, 
Streams Primer and Programmer’s Guide, Guide to BSD on RISC/os. 

IDT/ux Documentation Package 7RS551BDU 

Other MIPS Software for MacStation 

The following MIPS products are available for the MacStation. All require asigned license agreement 
prior to shipment. Contact your IDT sales office for a sample of the agreement. 

SPP for MacStation 7RS992SMT-L 

Source Code, site license, includes documentation 
SPP/e for MacStation 7RS993SMT-L 

Source Code, site license, includes documentation 



Integrated Device Technology, Inc. reserves the right to make changes to the specification in this data sheet in order to improve design or performance and 
to supply the best possible product. 

MacStation, RISC CPU Subsystem, RISController, and TargetSystem are Trademarks of Integrated Device Technology, Inc. Apple, Macintosh, AppleTalk, 
LaserWriter, A/UX, Multi Finder are registered Trademarks of Apple Computer, Inc. UNIX is a registered trademark of AT&T. MIPS, RISC/os, and RISCompiler 
are trademarks of MIPS Computer Systems, Inc. NuBus is a trademark of Texas Instruments, Inc. RISC/os is a trademark of MIPS Computer Systems, Inc.. 
Ethernet is a trademark of Xerox. VAX is a trademark of Digital Equipment Corporation. OS/2 is a trademark of IBM Corporation. MS-DOS is a registered 
trademark of Microsoft Corporation. 
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IDT/sim 

SYSTEM INTEGRATION MANAGER 
ROMable DEBUGGING KERNEL 

1DT7RS901 





FEATURES: 

• Provides complete control over hardware and 
software for system Integration 

• Fits In 82Kb of EPROM space, plus 16Kb of RAM 

• Provides CPU control for register and memory 
manipulation, cache access, and TLB 
management 

• Includes standard I/O support 

• Easy to add new commands and new I/O drivers. 

• Complete support for MIPS symbolic debuggers. 
No additional code required 

• Supports downloading code in either ASCII or 
binary formats 


POWERFUL TOOL FOR R3000 SOFTWARE/ 
HARDWARE INTEGRATION: 

The IDT7RS901 System Integration Manager (IDT/sim) 
is a ROMable software product that permits convenient 
control and debug of RISC systems built around the MIPS 
R3000 architecture. It permits users to quickly develop and 
debug stand-alone systems. Facilities are included to 
operate the CPU under controlled conditions, examining 
and altering the contents of memory, manipulating and 
controlling R3000 resources (such as cache, TLB and 
coprocessors), loading programs from host machines, and 
controlling the path of execution of loaded programs. Re- 
mote (source/symbolic) debugging is also supported. 

IDT/sim requires 82Kb of EPROM space for code and 
data and 16Kb of ram sapce for uninitialized variable data 
and stack. The minimal I/O system supported uses UARTS. 
The default drivers support the 2681 or 68681 devices. 
Other devices can be added easily. 



HOST SYSTEM 
( MacStation™, MIPS or PC) 


TARGET WITH IDT/sim 
System Integration Manager 
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IDT7RS901 

IDT/sIm SYSTEM INTEGRATION MANAGER PRODUCT BRIEF 


Debug Commands 

Provides commands for CPU control such as execution, register and memory manipulation, cache access 
and TLB management. 

Run Time Support 

IDT/sim provides numerous entry points that client programs may access to perform standard operations. 
These include I/O support (open, close, read, write etc.), standard console I/O support ( get/put char/string, 
printf) and facilities to install new commands and device drivers. 

Remote Debug 

Complete remote (source/symbolic) debug support is provided for operation with MIPS symbolic debuggers 
(DBX). This feature is incorporated into the monitor residing in EPROM so the user does not have to 
download any additional code (ie., dbgmon) to debug code running on his target system. Additionally, an 
easy to use interface is provided so user code that is interrupt driven may be debugged. 

Download Support 

Supports downloading code from a host to the users target machine. Both ASCII and binary formats are 
supported. 


Physical Address 


ifffffff 


lfcOOOOO 


00020000 


00000000 



Virtual Address 

bfffffff 


Used by Monitor 
Available for User Code 


Virtual Address 


bfcOOOOO 


a0020000 


aOOOOOOO 


/ 


/ 


/ 


/ 


/ 


/ 


/ 


/ 


/ 


/ 


/ 



Monitor 

Stack 


USER 

CODE 

SPACE 


Uninitialized 
111 Data Area fi 
(bss) 

Int. Vectors 


a0020000 


aOOleOOO 


a0002800 


aOOOOlOO 

aOOOOOOO 




8.9 


2 
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IDT/sim SYSTEM INTEGRATION MANAGER 


PRODUCT BRIEF 


LIST OF COMMANDS 

rad [-oj-dl-h] 

Set the default radix to the requested base. 
seg [-0I-1I-2M 

Set the default segment to the requested k-segment. 
historyjh 

Displays the last 8 commands entered 
help/? [commandlist] 

Prints a list of the commands available in the monitor. 
regseljrs [-cl-h] 

Selects display format for register names. 
checksum/cs 

Display the checksums for each of the ‘EPROMs’. 
initli 

Initialize prom monitor (warm reset) 
dbgintldi [-el-d>] <DEV> 

Debug interrupt enable/disable - allows ‘break key’ to 
gen extr. int. 

filllf [-wl-hl-bH-r] <RANGE> [value _list.] 

Fills memory specified by range with valuejist.. 

sub [-wl-hl-bl-ll-r] <address> 

This command allows the user to examine and 
change memory interactively. 

dumpld [-wl-h] <RANGE> 

Display contents of memory. 

movelm [-wl-b/-h] <RANGE> <destination> 

Move the block of memory 

comparejcp [-wl-bl-h] <RANGE> <destination> 

Compare the block of memory 

searchjsr [-wl-bl-h] <RANGE> <value> [mask] 

Search the area of memory for a value. 

wc [-i] [-wl-bl-h] <RANGE> [value.Jist] 

Fill cache memory with a pattern. 

cacheflush/cf [-i/-d] 

Flush both the i-cache and the d-cache. 

re [-i] <-w/-bl-h] <RANGE> 

Display cache memory. 

fr <reg#\name> <value> 

Put <value> into the register 

dr [reg#lname] 

Print out the current contents of registers. 
dis <RANGE> 

Disassemble the contents of memory. 


tlbdumpftd [RANGE] 

Dumps the contents of the translation buffer. 
tlbflushjtf [RANGE] 

Flushes the contents of the translation buffer. 

tlbmap/tm [-i index] [-ndgv] <vaddress> <paddress> 
Specify virtual to physical mapping in the translation 
buffer. 

tlbpid/ti [ pid ] 

Displays the current process identifier ( pid ). 
tlbptovltp <physaddr> 

Search the translation buffer for translations which 
map to <physaddr>. 

toad// <_device> [format] 

Down load from an I/O device. 

debug/db [DEV] 

Enter remote symbolic debug mode with host. 

go/g [-n] <address> 

Begin execution at address 

gotilllgt <address> 

Continue execution from the current PC till break 
address encountered 

calllca <address> [argl arg2 ... arg8] 

Invoke a ‘C’ language subroutine. 

step/s [<count>] 

Execute a specified number of instructions. 
contlc 

Continues execution of the client process from where 
it last halted execution. 

brklb [addresslist] 

Display currently set breakpoints or set break point at 
each of the addresses. 

unbrklub <bpnumlist> 

Unset breakpoints listed. 
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IDT/sIm SYSTEM INTEGRATION MANAGER 


PRODUCT BRIEF 


LIST OF RUN TIME SUPPORT ENTRY POINTS 

exitQ 

Returns control to the monitor. 

_atob(str,intptr,base,seg) 

Converts an ascii string to an integer. 

_clear_cache(begin_addr,num_bytes) 

Clears a selected area in I and D cache 

_cli(cmd_table, prompt) 

General purpose command line interpreter 

_flush_cache() 

Flushes both the I and D cache. 

_get_range(str,start, end) 

Parses the range specification. 

jgetcharQ 

Get a character from the standard input device. 
jgets(str) 

Get a string from the standard input device. 

_help(argc,argv,cmd_table) 

Print the usage line for all specified commands 

Jnstall_commands(cmd_table) 

Allows the user to extend the command set of the 
standard monitor. 

Jnstall_immediateJnt(ptrjuserJnt_rt) 

Installs a pointer to a user interrupt function that will 
be called by the monitor when an exception/ 
interrupt occurs. 

_nstall_new__dev(dt jptr.diptr) 

Installs a new device driver that will be recognized by 
IDT/sim 

Jnstall_normalJnt(ptr_user_int_rt) 

Installs a pointer to a user interrupt function that will 
be called by the monitor when an exception/ 
interrupt occurs. 

_ioctl(fd,cmd,arg) 

Sets flags for i/o characteristics and/or calls driver 
ioctl routines. 

_open(device, flags) 

Opens a device for reading and/or writing. 

_printf(format,[args]) 

Formatted print routine 

_j outchar(c) 

Output a character to the standard output device. 
_puts(str) 

Output a string to the standard output device. 
_read(fd,buf,cnt) 

Read data from an external device. 


_reset() 

Resets the monitor 
_setjmp(cur_cntx) 

Save the current context so that non-local goto’s may 
be implemented. 

_longjmp(cur_cntx) 

Restores the saved context so that non-local goto’s 
may be implemented. 

_showchar(c) 

Prints the character passed to it in a visible manner 
_strcat(s,t) 

Concatenate two strings (39) 

_strcmp(s,t) 

Compare two strings (36) 

_strcpy(s,t) 

Copy one sting to another (38) 

_strlen(s) 

Determine the number of characters in a string. (37) 
_tokenize(cmdline,argv) 

Parse command line and build argc/argv structure 

_write(fd,buf,cnt) 

Write data to an external device. 
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1DT7RS901 

IDT/sIm SYSTEM INTEGRATION MANAGER 


PRODUCT BRIEF 


ORDERING INFORMATION 

To order the IDT System Integration Manager, order the Developmental Use License AND order the software on the 
appropriate media. The license will be shipped to you for signature; on return the software will be shipped. You may also 
order binary distribution rights for the run-time version of the monitor. Ask your IDT sales office for information. 

Licenses 

Developmental Use License 7RS901 SLV 

Permits purchase of up to six copies of source code (any media combination) and use of 
source code to develop run-time binaries on up to six machines at a time, but does not 
permit inclusion of the run time code in an end product. 

Binary Distribution Rights 7RS901 BLP-L 

Extension to Developmental Use License to permit inclusion of binary code into end 
product. Development Use License must be referenced on order or ordered 
simultaneously. This license permits up to 100 copies to be distributed royalty -free. 

Additional copies are subject to the royalty below, or a one-time buyout. 

Binary Distribution Sublicense 7RS901 BLC-L 

Per Copy Royalty for distribution of runtimes developed using the System Integration 
Manager beyond the first 100.. 

Maintenance Agreement 7RS901SSY 

One year free minor updates, and discounted upgrade to major update versions. We 
supply a direct telephone contact for support. 


Source Media 

Source for 286/386, MS-DOS 7RS901 SAF-L 

Use with IDT/c C-Compiler (7RS903). Shipped with both 1.2 MB 5.25" and 1.44 MB 3.5" 
diskettes 

Source for 286/386 PC, SCO Xenix 7RS901 SXX-L 

Use with IDT/c C-Compiler (7RS903). Developmental Use License number must be 
referenced on order, or must be ordered simultaneously. 

Source for IDTMacStation, on Mac Disc 7RS901SMD-L 

Use with MIPS C Compiler supplied with MacStation or with IDT/c. Developmental Use 
License number must be referenced on order, or must be ordered simultaneously. 

Source for MIPS or SUN Machine, QIC-24 TAR Tape 7RS901 SUU-L 

Use with MIPS C Compiler or with IDT/c. Developmental Use License number must be 
referenced on order, or must be ordered simultaneously. 

EPROM Versions 

The following versions of IDT/sim are supplied in EPROMs for the indicated hardware. 

These versions are for updating the hardware to the latest version of the monitor. 


For Evaluation boards and Prototyping Systems 7RS901BAP 

Use with 7RS382.383, or any 7RS300 series module Prototyping Platform 

For the MacStation 1 7RS901BBP 

Use with 7RS501 Original MacStation CPU board 

For the MacStation 2 7RS901 BCP 

Use with the 7RS502 MacStation CPU board 


Auxiliary Download Programs 

Fordownloading code from a MIPS machine into IDT/sim. This software includes 
programs to convert MIPS object code into S-records and to download either ASCII or 
binary S-records to a remote target. This software is only needed with MIPS computers; 
all other machines (including the MacStation) have standard utilities available to perform 
thisfunction. 
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MULTI-HOST C-COMPILER SYSTEM 

Integrated Device Technology, Inc. 

1 



FEATURES: 

• Includes C-compIler, Optimizing Scheduler, 
Assembler, and Linker 

• Optional Floating Point Software 

• Meets Plum Hall 2.00 ANSI C validation suite 

• Runs on 80286 and 80386 machines under MS- 
DOS™ or XENIX™, on MIPS machines under 
RISC/os, and on MacStation™ under IDT/ux 

• Supports entire IDT family of MIPS ISA Proces- 
sors: R3000, R3001, R3051, and R3052 

• Loader communicates with IDT's System Inte- 
gration Manager (IDT/sim) 

• Provides control over multiple memory 
segments 


OPTIMIZING C-COMPILER SYSTEM: 

IDT/c consists of a set of software products that run on a 
variety of platforms, and which together produce highly 
efficient code for R3000 CPUs. The code can be down- 
loaded in several formats to a target machine for execution. 
On the target machine, the code can be controlled with IDT’s 
System Integration Manager (IDT/sim). 

The compiler is based on the popular GNU C compiler, 
and is fully compliant with ANSI C. 

The entire package is available for execution on 286 or 
386 machines under MS-DOS or XENIX, as well as the MIPS 
and SUN workstations, and IDT’s MacStation single user 
workstation. 

For any platform, IDT/c can be ordered with or without a 
software floating point library. A switch in the compiler 
determines if floating point instructions will result in R3010 
instructions in the object code or whether calls to the floating 
point library will be made instead. 
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1DT7RS903 

IDT/c R3000 C-COMPILER 


DESCRIPTION: 

The IDT/c C-Compiler System is a complete develop- 
ment package for CPUs based on the R3000 architecture. 
It contains an optimizing cross compiler, scheduleroptimizer, 
cross assembler, linker, and a downloader. The ‘C’ compiler 
is compliant with ANSI ‘C’ standard and performs the opti- 
mizations available in state of the art ‘C’ compilers. The 
assembler supports the R3000 machine instructions and 
architecture described in the book by Gerry Kane, “MIPS 
RISC Architecture”, including both native and synthetic 
instructions. The complete IDT/c package runs on a variety 
of host machines and operating systems and is part of IDT’s 
cross development system tools which include other pack- 
ages such as debug monitors and libraries. 

Compiler 

The C pre-processor is GNU cpp and the compiler itself 
is based on GNU C. All C-preprocessing features are 
supported. The combination of the compiler and assembler 
included in IDT/c has been tested for compliance to the 
ANSI ‘C’ standard using the Plum Hall test suite and is 
compliant. C programs written forthe MIPS C compiler may 
also be compiled without modification. 

The C compiler performs extensive optimization in mul- 
tiple passes through the code. Each of the many optimiza- 
tion techniques can be individually switched on or off with 
compiler directives. 

Optimizing Scheduler and Assembler 

The IDT cross assembler input is compatible with source 
code written for the MIPS assembler. It implements the 
R3000 native instruction set as well as the augmented 
synthetic instructions defined in the “MIPS RISC ARCHI- 
TECTURE” book by Gerry Kane. There are some exten- 
sions in the IDT cross assembler that provide the program- 
mer with more control over code generation, such as ‘laiu’ - 
load address upper and ‘oria' - load address lower, enabling 
direct programming in pure assembly language. The as- 
sembler produces .o files which are later linked togetherwith 
other files to produce an executable file. 

The scheduler first expands the synthetic instructions 
into the native instruction set. It then rearranges code to 
allow for and take advantage of R3000 pipeline architecture. 
At the same time the scheduler analyzes loads of static 
constants and makes use of previously loaded constants 
that are close in value. 


PRODUCT BRIEF 


Memory description file 

The memory description file is used to instruct the linker 
where to place object modules in the R3000 memory map. 
It tells linker what address classes are legal, what addresses 
exist within those classes, and what addresses should be 
written to output files. The file consists of a sequence of 
class specifications (CODE, DATA, etc.) and associated 
address ranges. 

Linker 

The linker combines together separately assembled pro- 
gram files into one object module. Command line switches 
may be used to override the memory description file. 

The format of object code produced by the assembler in 
IDT/c is not compatible with the format produced by the 
MIPS assembler, so modules compiled by the MIPS soft- 
ware cannot be linked directly with modules compiled by 
IDT/c. Recompilation under IDT/c is required. 

There are three types of output file formats supported: S- 
Records, INTEL hex, and binary image. The S-Record files 
are useful in down-loading to target boards. The INTEL hex 
format files are useful for EPROM programing because the 
linker provides for the code to be divided into multiple files 
under this format. 

Endianness 

IDT/c is Big-Endian. 

Floating Point Library 

IDT/c may be ordered with a floating point library. A 
switch in the compiler is set at compile time to determine 
how the compiler should handle floating point instructions. 
In the the normal mode, it will produce R301 0 Floating Point 
Accelerator instructions in the object code. If the switch is 
set the other way, the compiler will insert calls to the floating 
point library instead, and the floating point library must be 
lavailable at link time. Because the compiler knows about 
the libary during compile time, it can perform optimizations 
not otherwise possible and keep the execution penalty for 
using software instead of hardware to about a factor of 4 in 
very fp intensive code. 
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IDT/c R3000 C-COMPILER 


PRODUCT BRIEF 


OPTIMIZATION PASSES 

Multiple optimization passes are performed by the GCC compiler. Below is a brief description of what takes place on 
each pass. Note that switches can be used in the compiler to turn individual optimization choices off or on, providing the 
programmer with a great deal of control over how the compiler modifies the code. 

Jump optimization 

Simplifies jumps to the following instruction, jumps across jumps, and jumps to jumps; while deleting unreferenced 
labels and unreachable code. 

Register Scan and common subexpression elimination 

Finds first and last use of each register for purposes of subexpression elimination while performing constant propa- 
gation. 

Loop optimization and strength reduction 

Moves constant expression code outside of dynamic loop. 

Data flow analysis 

The program is divided into basic blocks and identifies the life of values in registers. Once done, then code 
producing unused results can be eliminated and unreachable loops are eliminated. 

Local register allocation 

Allocates registers to be used inside each basic block. 

Global register allocation 

Assigns registers for values which live across basic block boundaries. 

Final Pass 

The final pass is to generate assembler code. At this point peephole optimizations are performed as well as 
generating and optimizing the function entry and exit code sequences. 


PERFORMANCE COMPARISONS 


Execution 

To obtain a measure of the efficiency of the IDT/c com- 
piler, a set of benchmark programs was compiled under 
both IDT/c and the MIPS compiler, and the size and execu- 
tion time of the resulting binaries were compared. 


Execution Time Comparison 



Code Size 

Exec. Time 

Compiled with MIPS C 

1.0 

1.0 

Compiled with IDT/c 

1.20 

1.19 


Compile Time 

The time required to compile a program under IDT/c 
depends on the machine speed, type, and configuration. 
For comparative purposes, the Stanford benchmark was 
compiled under a variety of hosts and the results are shown 
below. For reference, the same program was also compiled 
using the MIPS compiler. 


Compile Time Comparisons 

Host 

Compile Time 

MIPS C on MIPS Machine 

24 sec. 

IDT/c on MIPS Machine 

25 sec. 

IDT/c on 10 MHz 286, MS-DOS 

695 sec. 

IDT/c on 25 MHz 386, Xenix 

70 sec. 
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IDT/c R3000 C-COMPILER 


PRODUCT BRIEF 


COMMAND LINE SWITCHES 


ASSEMBLER DIRECTIVES 


-E : 

pre-process only. .S file is expected. Pre-processed 
file is written to the standard output. 

-O : 

Optimize (GNU cc -O option). 

-01 : 

Optimize even more (GNU cc options: -fstrength- 
reduce -fforce-addr -fforce-mem -fcombine-regs 
-finline-functions). 

-c : 

Assemble only, do not link. Expected are filenames 
with .s or .S suffixes. Output files (in absence of -o) 
will have .0 suffix. 

-ZA : 

produce assembly listing. 

-o xxx : 

Name output file. The default output name is 
‘out.sre’. 

-ZL : 

Produce link map. 


.align n, n=1-3 

align so that n least significant bits of address are 0. 

.ascii “string” 

assemble string. 

.bss 

store following into bss section. 

.byte arg.arg arg 

assemble arguments into consecutive bytes. 

.data 

store following into data section. 

.end name 

Included for MIPS asm compatibility but ignored. 

.ent name 

Included for MIPS asm compatibility but ignored, 
.extern name,n 

Import symbol ‘name’ that refers to n bytes of storage 
. (included for MIPS asm compatibility and ignored). 

.globl name 

export defined symbol. 


-Fxxx.xxx : 

Use xxx.xxx as memory layout description file. In 
absence of -F option the default is to use file idt.mem 
in default library directory. 

-ZThhhhhhhh : 

Specify text loading address, hhhhhhhh is address in 
hex, up to 8 hex digits. This will override .mem file 
definitions. 

-ZDhhhhhhhh : 

Specify data loading address, hhhhhhhh is address 
in hex, up to 8 hex digits. This will override .mem file 
definitions. 

-e name : 

Use global ‘name’ as program start address. 

-noenv : 

Do not include default library modules which define 
the order of program sections and global symbols 
that point to beginning and end of text, data and bss. 

-nostdlib : 

Do not include library for linking with IDT PROM 
monitor. 


.half arg.arg arg 

assemble arguments into consecutive halfwords. 

.set argument 

argument can be : 

at - error flag every use of $1 . 

noat - disable errors due to user’s usage of $1 
(at). 

reorder - enable scheduling to resolve pipeline 
conflicts. 

noreorder - disable scheduling. 


.space n 

skip next n bytes, advancing location counter by n. 


.text 

store following into text section. 

.word arg.arg.-.-.arg 

assemble arguments into consecutive words. 



SEGMENT 

The SEGMENT directive selects the address seg- 
ment where the following code or data will be stored. 
It is used to implement ‘.text’, ‘.data’ and ‘.bss’ which 
are MIPS compatible segments. Using this directive 
the user can create other custom segments. 
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PRODUCT BRIEF 


ORDERING INFORMATION 

The IDT/c C-Compiler is an efficient R3000 C-compiler system based on the popular GNU C and hosted on a variety 
of computers. The IDT/c system includes the compiler, assembler, scheduler and linker. All PC versions of the software 
are shipped with both 1 .2 MB floppy discs and 1 .44MB 3.5" diskettes. A "boxtop" single user license is included with the 
product. 


Media, without Floating Point 

The software listed below does not include the floating point library. 

For 286 machine, MS-DOS 7RS903BAF-N 

Not recommended for large, complex programs. At least 2MB RAM recommended. Requires DoOS version 
3.3 or greater. 

For 386 machine, MS-DOS 7RS903BBF-N 

Note: as of 1 0/1/90, this product is the same as the 286 MS-DOS version, but a performance enhancement is 
planned for early 1991. Registered users of this version of the software will automatically receive the enhanced 
software as soon as available. 

For 286 machine, SCO Xenix 7RS903BYX-N 

For 386 machine, SCO Xenix 7RS903BXX-N 

For MIPS machine RISC/os, on QIC-24 TAR Tape 7RS903BUU-N 

For MacStation, on Macintosh Tape 7RS903BMD-N 

Runs on MacStation R3000 board under IDT/ux. 

For SUN Sparcstatlon, on QIC-24 TAR tape 7RS903BWU-N 


Media, with Floating Point Library 

The software listed below includes the floating point library. 

For 286 machine, MS-DOS 7RS903FBAF-N 

Not recommended for large, complex programs. At least 2MB RAM recommended. Requires DoOS version 
3.3 or greater. 

For 386 machine, MS-DOS 7RS903FBBF-N 

Note: as of 1 0/1/90, this product is the same as the 286 MS-DOS version, but a performance enhancement is 
planned for early 1991. Registered users of this version of the software will automatically receive the enhanced 
software as soon as available. 

For 286 machine, SCO Xenix 7RS903FBYX-N 

For 386 machine, SCO Xenix 7RS903FBXX-N 

For MIPS machine RISC/os, on QIC-24 TAR Tape 7RS903FBUU-N 

For MacStation, on Macintosh Tape 7RS903FBMD-N 

Runs on MacStation R3000 board under IDT/ux. 

For SUN Sparcstatlon, on QIC-24 TAR tape 7RS903FBWU-N 

Floating Point Upgrade 

The version of the compiler without floating point may be upgraded to add the floating point library. To upgrade, contactyour 
IDT sales office. Indicate the order code and serial number for your original software on your order, so we will ship the correct 
format. 


Floating Point Upgrade 7RS905BZU 

Maintenance 

Maintenance 7RS903BSY 


Includes free upgrades for one year and direct telephone contact for support 


8.10 


6 





CROSS ASSEMBLER IDT7RS904 

FOR IBM PCs AND CLONES 


FEATURES: 

• C-LIke Pre-processor allows programmer to 
extend Instruction set 

• Provides for separate code segments loaded at 
different memory locations 

• Outputs S-records and Intel HEX 

• Provides symbolic and hex instruction listing 

• Lists Absolute addresses of code segments and 
symbols 


DESCRIPTION: 

IDT7RS904 is a cross assembler with a C-like 
pre-processor and linker that produces Motorola S-record 
or INTEL HEX downloadable files and a downloader. It is 
intended for cross-development with R3000 as target 
architecture. The assembler is compatible with files written 
forthe MIPS assembler. The assembler supportsthe R3000 
machine instructions and architecture descibed in the book 
by Gerry Kane, “MIPS RISC Architecture”. The cross 
assembler package runs on variety of host machines and 
operating systems. The pre-processor, assembler and 
linker are invoked by a single driver named “asr3k”. 


• Interfaces to IDT System Integration Manager 
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R3000 CROSS ASSEMBLER FOR PCs 


PRODUCT BRIEF 


PRODUCT FEATURES 

Pre-processor 

The C-like pre-processor is provided to process assem- 
bly language files such that the programmer can extend the 
instruction set of the cross assembler. All C- preprocessing 
features are supported. 

Assembler 

Assemblers for R3000 differ from ‘classical 1 assemblers 
because they attempt to transparently provide the richer 
instruction set. This is due to the RISC architecture, which 
provides only essential instructions and achieves high per- 
formance in return. In order to enable higher efficiency in 
assembly-language programming, assemblers introduce 
instructions that do not directly map into the machines 
instruction set. In this way programmers in effect can 
program with a richer instruction set than that of the proces- 
sor. 

The 7RS904 is compatible with the MIPS assembler. It 
implements the R3000 instruction set as defined in the 
“MIPS RISC ARCHITECTURE” book by GERRY KANE 
(appendix D). With a few exceptions programs written for 
the MIPS assembler will directly compile. The cross assem- 
bler varies from the MIPS assembler in that it requires 
labels, if present, to start in column 1 and operators to start 
in a column greater than 1 . 

Linker 

The linker combines together, seperately assembled 
program files into one object module. It uses information 
from the memory description file to determine where the 
object module will be placed in the R3000 address map. 
Command line switches may be used to override memory 
description file. 

Memory DescriptionFile 

The memory description file tells linker what address 
classesare legal, what addresses exist withinthose classes, 
and what addresses should be written to output files. The 
memory description file consists of a sequence of class 

ORDERING INFORMATION 

7RS904 Cross Assembler for PCs 

This is a macro-assembler that runs on any PC running 
MS-DOS. A single user license is included. 


specifications (CODE, DATA, etc.) and associated address 
ranges. In this way the linker can be instructed to place 
code at different locations. 

The linker can generate three different types of output 
files. The file type is controlled from the memory desricption 
file, by placing a FILETYPE specification in the output file 
specification line. There are three choices: DUMP - Gener- 
ates single .sre S-recordfile. (default), INTEL - Intel hex file 
format, BINARY - Generates a binary image of the file. 

DownloadableFiles 

The linker can output S-record files (.sre), in which 
individual S- records are lines of text as interpreted by the 
host system conventions, ie. they are separated by line 
delimiters. The ‘dl* program can convert files to S-records 
with no intervening delimiters. 

EPROM Files 

The linker can output INTEL hex files for the purposes of 
burning EPROMs. The linker may be instructed to divide the 
output code and data segments into multiple files to 
accomodate seperate sockets and address spaces. 

Extentions 

There are some extensions in the cross assembler that 
provide the programmer with more control over code gen- 
eration, such as ‘lau’ - load address upper and ‘laP - load 
address lower, enabling direct programming in pure assem- 
bly language. 

lal reg.addr Load Address Lower. Loads 

(using ori) lower 1 6 bits of 
address into register. Ad- 
dress is relocatable symbol 
(can be external). 

lau reg.addr Load Address Upper. Loads 

(using lui) upper 16 bits of 
address into register. The 
lower 16 bits are cleared. 
Address is relocatable sym- 
bol (can be external). 


For PC/XT, MS-DOS, on 320 Kb Floppy 

7RS904BPL-N 

For PC/AT, MS-DOS, on 1.2 Mb Floppy 

7RS904BAF-N 
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Integrated Device Technology, Inc. 


IDT/fp 

FLOATING POINT LIBRARY 

FOR USE WITH R3000 COMPILERS 

ADVANCE 

INFORMATION 

IDT7RS905 




FEATURES: 

DESCRIPTION: 



• Conforms to IEEE 754 format The IDT7RS905 product is a floating point (IEEE-754) 

arithmetic library for use in IDT79R3000 systems that do not 

• Converts ASCII to/from Floating Point incorporate a IDT79R3010 floating point coprocessor. It 

provides the basic single and double precision arithmetic 

• Converts Integer to/from Floating Point functions (add, subtract, multiply and divide) as well con- 

version routines between different precisions, integer and 

• Adheres to IEEE 754 Error Handling ascii formats. The IEEE-754 single precision floating point 

format represents numbers ranging from ±1.2E-38 to 

• Works with IDT/c on all platforms +3.4E+38 with 24 bit mantissa precision. The double 

precision format offers a range of ±2.2D-308 to ±1 .8D308 
with a 53 bit mantissa. The accuracy of the floating point 
library is within one least significant bit. The IEEE floating 
point format defines special representations for underflow 
(result = zero), overflow (result = + INF or - INF), and invalid 
operation (result = Not a Number, NaN). The floating point 
library adheres to the IEEE-754 error handling procedure in 
all applicable cases. 


SUPPORTED OPERATIONS 


Addition 

FPADD(a.b) & DPADD(a,b) 

Subtraction 

FPSUB(a.b) & DPSUB(a,b) 

Multiplication 

FPMUL(a,b) & DPMUL(a,b) 

Division 

FPDIV(a.b) & DPDIV(a.b) 

Comparision 

FPCMP(a.b) & DPCMP(a,b) 

Integer to FP 

FPFLT(int) & DPFLT(int) 

FP to Integer 

FPINT(sp) & DPINT(dp) 

DP to SP 

DPTOSP(dp) 

SP to DP 

SPTODP(sp) 

ACS II to FP 

FASCBIN() & DASCBINQ 

FP to ASCII 

FBINASCQ & DBINASC() 
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IDT7RS905 

IDT/fp FLOATING POINT LIBRARY 


PRODUCT BRIEF 


ORDERING INFORMATION 

This software contains a library of floating point routines that perform functions equivalent to the R3010 Floating Point 
Accelerator chip. The routines can be linked with output from any of the compilers listed below, and are ROMable. The 
execution time is roughly 30 times that required by the R301 0. The Developmental Use License and copies of the software 
in one or more media should be ordered separately. The license must be signed before the software is shipped. 

Developmental Use License 7RS905BLV 

Permits purchase of up to six copies of the software in any media, and permits up to six copies 
running in developmental machines, but does not permit inclusion of code in an end product. 

Binary Distribution Right 7RS905BLP-L 

Permits unlimited, royalty free distribution of run-times containing the Floating Point Library . 

For use with 7RS903BAF-N (PC) 7RS905BAF-L 

For use with MacStatlon 7RS905BMD-N 

For use with MIPS machine, RISC/os 7RS905BUU-L 

For use with 7RS903BXX 

(386 Xenix Disc) 7RS905BXX-L 

For use with 7RS903BYX 

(286 Xenix Disc) 7RS905BYX-L 

Binary Distribution Rights 7RS905BLP 

Permits royalty-free distribution of mn-time code. 

Maintenance 7RS905BSY 

One year free updates 
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THIRD-PARTY DEVELOPMENT TOOLS AND 
APPLICATIONS SOFTWARE FOR IDT RISC 
PROCESSORS 




OVERVIEW 

The MIPS/IDT RISC Microprocessorfamily is supported by 
a wide variety of third-party development tools and applica- 
tions software. Many of these tools are software products, 
useful across the entire line of processors; others of these are 
hardware development tools, appropriate for one or two 
members fo the family. 


As the MIPS architecture is increasingly popular and suc- 
cessful, many new tools are constantly being announced. IDT 
encourages our customers to work closely with their local 
sales representative for a current list of third party support. 
This listing is intended to be reasonably current as of the data 
of this document. 


Type Vendor 

Product Name 

Phone 

Logic Analyzer 

Arium 

ML4400 

(714)-978-9531 
(800) 862-7486 

Biomation 

CLAS4000 

(800) 538-9320 
(408) 988-6800 

Hewlett Packard 


local HP sales office 

Tektronix 

DAS9200 

(800) 426-2200 
(503) 627-7111 

In-Circuit Emulator 

Embedded Performance 


(408) 980-8833 

Hardware Simulation Models 

Mentor Graphics 

HML 

(800) 547-7390 
(503) 626-7000 

Logic Modeling 

Systems 


(408) 957-5276 

Software Simulation Models 

Logic Automation 

R4000 Sim 

(503) 690-6900 

Ada Development on Vax host 

InterACT 


(212) 696-3700 

Telesoft 


(619) 457-2700 

VME Board 

Creative Electronics, Switzerland 

(022) 792 57 45 
(022) 792 57 48 (fax) 

Lockheed Sanders, Inc. 


(603) 885-6022 

Omnibyte 


(708) 231-6880 

Real-time Operating Systems 

JMI 

C-EXECUTIVE™ 

(215) 628-0846 

Lynx OS 

Lynx™ 

(408) 354-7770 

Wind River Sys. 

VxWorks™ 

(415) 748-4100 

Laser Printer Languages 

Adobe Systems 

Postscript™ PDL 

(415) 961-4400 

Microsoft 

Truelmage™ 

(206) 882-8080 

Peerless Group 

PeerlessPage™ OS 

(213) 536 0908 
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By Michael J. Miller 


The goal of every computer architecture is to decrease the time it 
takes to execute a task. With RISC the goal is no different, but the 
way this is achieved is different than previous architectures such as 
CISC. To achieve this desired goal, many aspects of CPU design 
must be addressed. One of the techniques used in RISC to achieve 
a faster execution rate is to implement a reduced instruction set to 
gain shorter clock cycles. However it is not sufficient to just speed 
up the cycle time of the processor. One must pay attention to how to 
best feed the streamlined processor. In other words, the architect 
needs to solve the problem of how to support the increased 
instruction and data bandwidth required by the CPU. Beyond 
speeding up the datapath, the architect must also address issues 
such as how to achieve efficient exception handling, fast context 
switching, memory management and fast Floating Point operation. 
For further performance, the architect must also address how to tie 
together multiple copies of this architecture to achieve ever 
increasing power. This application note will concentrate on how 
these issues are addressed by the IDT79R3000. 

The Performance Equation 

The measurement of performance is the time it takes to complete 
a task, (which is a product of the number of instructions to be 
executed for the task, the number of cycles per each instruction and 
cycle time (Figure 1 .) ) The architects of some of the most recent 
generation of CPUs (CISC) have spent most of their time 
addressing the number of instructions per task. By increasing the 
complexity of the instruction set, the number of instructions to 
execute the task was decreased. However, each term of the basic 
performance equation is not independent from the others. In 
increasing the complexity of the instructions, the number of cycles 
to execute the instructions increased. The RISC philosophy is to 
roll-back the complexity of the instructions and reduce the cycles 
per instruction. This reapportions the resources of the silicon to 
execute more instructions in parallel as well as include structures to 
control cache and memory management. 

TIME INSTRUCTIONS CYCLES TIME 

TASK ~ TASK X INSTRUCTION X CYCLE 

Figure 1. Performance Equation 

The first term of the performance equation is the number of 
instructions per task. Whether it be RISC or CISC, the number of 
instructions to be executed can be minimized by optimizing 
compilers that have come into their own in the last 6 years. RISC 
strives to take advantage of these optimizing compilers by 
including simple register to register instructions which can be 
utilized more efficiently by the compiler. Additionally, it takes less 
time to operate on data already in the register than it does to go 
external to the CPU. The RISC philosophy is to include a large 
number of registers on the CPU chip and to require the compiler to 
make efficient use of the registers through effective allocation. 
Register allocation is an operation that allows optimizing compilers 
to achieve some of the biggest gains in performance. 

In choosing those instructions to be included in the architecture, 
the designers of RISC analyzed the typical programs executed on 


the CPU. It was found that about 90% of all instructions executed in 
a task were simple loads and stores, ALU operations and branches. 
If the architects could speed up these instructions, they could make 
a significant performance gain. The other 10% of instructions were 
more complex operations. Previous generations of CPUs used 
more of the si I icon for this 1 0 % than they used for the 90% of si mple 
operations. It was found that few architectural additions could be 
made to speed up this 10% of the instruction space without slowing 
the execution of the other 90% of the instructions. Therefore, RISC 
design focuses in the area that promises the biggest gain, i.e. the 
simpler instructions. 

Once the architects isolated the key instructions to speed up, 
they could minimize the cycles that it took to execute those 
instructions. Pipelining was employed so that on every clock cycle 
a new instruction could be started before the previous one 
completed. Although the goal of RISC is to execute one instruction 
on every clock cycle, not all instructions can be completed on one 
clock cycle. Some instructions have a latency effect. These 
instructions typically are branches, loads and stores. Therefore, the 
task is to remove the latency as much as possible in these 
instructions to achieve the single-cycle execution goal. These 
latencies are mainly caused by the external memory pipelines for 
loads and branches. When branching, the way the test conditions 
are determined has the most affect on the latency. Fast RISC CPUs 
therefore, only compare operands for equality or against zero to 
eliminate carry/propagate chains in ALUs. This solution simplifies 
the architecture for increased speed without compromising the 
power. 

The third part of the equation is to maximize the clock speed. 
This is done through improving the process technology that is used 
to implement the architecture, (such as bipolar, NMOS or CMOS), 
as well as the implementation of the architecture in it’s logical form. 
As the internal delays are decreased and clock cycles shortened, 
the impact of external paths becomes greater. With RISC, the 
critical path external to the CPU is through the memory. To speed up 
the memory access for systems with big main memories 
implemented in DRAM, caches have been employed. The access 
time of the cache is determined by the organization and the 
implementation of the tag checking. Therefore, caches are integral 
to supporting the increased clock speed of RISC in big systems, 
but the caches alone cannot keep the CPU supplied. 

The Memory Hierarchy 

Programs and data are stored in DRAMs and secondary storage 
devices like hard disks which have slow access times in 
comparison with a cycle time of the ALU. The total RISC system 
implementation utilizes a memory hierarchy where each level ofthe 
hierarchy is, in effect, a cache for the next level down. The top of the 
hierarchy is a very small fast cache (register file) while the bottom 
level (main memory) is slow and very large. As an example, to 
support 20 MIPS operation, the CPU needs at least 200 Mbytes/sec 
of instruction and data bandwidth while main memory typically 
only supports 28 Mbytes/sec (see Figure 2). Previous CPUs, like 
the 68020, only required 15 Mbytes/sec to support 1.5 MIPS of 
performance. 
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HIGHEST BUS BANDWIDTH 
(NATIVE RATE OF THE CPU) 


25MHZ CPU 
200 MBYTES/SEC BUS 


25 MHZ 

100 MBYTES/SEC 


12 MHZ 

48 MBYTES/SEC 


LOWEST BANDWIDTH 
(WHERE THE PROGRAM RESIDES) 


Figure 2. The Memory Hierarchy 


The first, level of the hierarchy is the register file that keeps the 
ALU fed with data. The management of the registerfile is performed 
at compile time. Without optimizing compilers, the total power of 
RISC cannot be realized. 

The second level of the hierarchy is the instruction and data 
caches. Because RISC CPUs require a new instruction every clock 
cycle, the access time of the cache SRAM is the pacing delay in a 
RISC CPU’s cycle time. To further increase the bandwidth into the 
RISC CPU, a separate cache for data is used. 


The third level hierarchy is the main memory consisting of cache 
hard disks. Today’s RISC CPUs employ on-chip a memory 
management unit (MMU) to manage the swapping of program 
segments between main memory and the disk. The Translation 
Lookaside Buffer (TLB) is a key element in the MMU that translates 
the logical addresses to the physical address of the programs 
stored in main memory. The TLB is a look-up table of addresses. 
Since circuitry to implement TLBs is costly in silicon area, TLBs are 
kept small and function as another cache for a large master look-up 
table kept in main memory. 
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HIERARCHY 


MANAGEMENT 

TABLES 


MANAGEMENT CONTROL 


REGISTER 


OPTIMIZING COMPILER 



MAIN 


VIRTUAL 





CACHE 

TAGS 



SPEED: IF MISS BLOCK REFILL 
HARDWARE, LATENCY TO 1ST 
WORD DETERMINED BY SYSTEM 


SPEED: IF MISS SOFTWARE EXCEPTION 
9 INSTRUCTIONS, ONE LOAD. MORE 
EXOTIC SCHEMES CAN BE ACCOMMODATED 


SPEED: IF MISS SOFTWARE CONTROLLED, 
DISK ACCESS, PAGE TABLE UPDATE 


Figure 3. Managing the Memory Hierarchy 


More than a Data Path 

Therefore, RISC CPUs need not only a datapath, but also a 
memory hierarchy that must be managed by the CPU and software 
in order to operate efficiently. All levels of the hierarchy are used to 
feed the insatiable appetite of the instruction and data path of 


today’s RISC CPUs. Since the reduced instruction set is only one 
aspect of the RISC CPU technique, perhaps a more appropriate 
name would be Bandwidth Increased Streamlined Computer 
(BISC). This new term could be used at the 'rise 'of adding more 
letters to todays’ already full bowl of alphabet soup! 
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By Satyanarayana Slmha 


INTRODUCTION 

The exception processing capability of the IDT79R3000 is 
provided to assure an orderly transfer of control from an executing 
program to the supervisor program. Exceptions may be broadly 
divided into two categories: those caused by an instruction, 
including an unusual condition arising during its execution, and 
those caused by external events such as interrupts. When an 
IDT79R3000 detects an exception, the normal sequence of 
instruction is suspended; the processor exits User mode and is 
forced to the Kernel mode where it can respond to the abnormal or 
asynchronous event. This paper presents an overall view of the 
types of exceptions in the R3000 and the exception handling 
registers. It then describes one specific exception, namely 
interrupts, the latency associated with it and gives an example of 
code on how to handle an interrupt event. 

EXCEPTION PROCESSING 

The R3000’s exception handling system efficiently handles 
machine exceptions, including Translation Lookaside Buffer (TLB) 
misses, arithmetic overflows, I/O interrupts, system calls, 
breakpoints, reset, and coprocessor unusable conditions. All of 
these events interrupt the normal execution flow. The R3000 aborts 


the instruction causing the exception and also aborts all those 
following in the exception pipeline which have already begun 
execution. The R3000 then performs a direct jump into a 
designated exception handler routine. 

When an exception occurs, the R3000 loads the EPC (Exception 
Program Counter) with an appropriate restart location where 
execution may resume after the exception has been serviced. The 
restart location in the EPC is the address of the instruction causing 
the exception. If the exception occurred in a branch delay slot, the 
EPC contains the address of the branch instruction immediately 
preceding the delay slot. 

EXCEPTION HANDLING REGISTERS 

The system coprocessor (CPO) registers contain information 
pertinent to exception processing. Software can examine these 
registers during exception processing to determine such things as 
the cause of an exception, and the state of the CPU at the time of an 
exception. There are six registers handling exception processing 
(shown in Figure 1). These are the Cause register, the EPC register, 
the Status register, the BadVAddr register, the Context register, and 
the Prld register. A brief description of each follows. 



Figure 1. The Exception Handling Registers 

The Cause Register: register with the exception of the Sw bits are read-only. The Sw bits 

The contents of this register describe the last exception. A 4-bit n written into in order to set or reset software interrupts. See 

exception code indicates the cause. The remaining fields contain Figure 2. 

detailed information specific to certain exceptions. All bits in this 
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31 0 




CE 

0 

IP[5..0] 

Sw 

0 

ExcCode 

Hoi | 

1 

1 

2 

12 

6 

2 

2 

4 

2 


BD : Branch delay. 

CE : Coprocessor Error 
IP : Interrupts Pending 
Sw : Software Interrupts* 

*: Read and Write. The rest are Read-only 


ExcCode : Exception Code Field 



: Reserved 


Figure 2. The Cause Register 


The EPC (Exception Program Counter) Register: 

The 32-bit register contains the address where processing can 
resume after an exception has been serviced. This register 
contains the virtual address of the instruction that caused the 
exception. When the virtual address of the instruction resides in a 
branch delay slot, the EPC contains the virtual address of the 
instruction immediately preceding— which is the Branch or Jump 
instructions. 

Bad VAddr Register: 

The Bad VAddr register saves the entire bad virtual address for 
any addressing exception. 


Context Register: 

The Context register duplicates some of the information in the 
BadVAddr register, but provides the information in a form that may 
be more useful for a software TLB exception handler. 

The Status Register: 

This register contains all the major status bits. Any exception 
puts the system in Kernel mode. All bits in the status register, with 
the exception of the TS (TLB shutdown) bit are readable and 
writeable; theTS bit is read-only. Figure 3 shows the functionality of 
the various bits in the status register. 


31 28 

27 23 22 

21 

20 

19 

18 

17 

16 

15 8 

7 

6 

5 

4 

3 

2 

1 

0 

CU 

(Cu3. . .CuO) 

HI 

BEV 

TS 

PE 

CM 

» 

SwC 

IsC 

IntMask 

Intr5. . . 0 Swl .0 

0 

KUo 

lEo 

KUp 

lEp 

KUc 

lEc 

4 

5 

1 

1 

1 

1 

1 

1 

1 

8 

2 


1 

1 

1 

1 

1 

1 


CU : Coprocessor Usability 

BEV : Bootstrap Exception vector 

TS : TLB shutdown 

PE : Parity Error 

CM : Cache Miss 

PZ : Parity Zero 

SwC : Swap Caches 

IsC : Isolate Cache 


IntMask : Interrupt Mask 
KUo : Kemel/User mode, old 
lEo : Interrupt enable, old 
KUp : Kemel/User mode, previous 
lEp : Interrupt enable, previous 
KUc : Kemel/User mode, current 
lEc : Interrupt enable, current 
O : Reserved 


Figure 3. The Status Register 


Processor Revision Identifier Register: 

This 32-bit register contains information that identifies the 
implementation and revision level of the Processor and System 
Control Co-Processor. 

EXCEPTION VECTOR LOCATIONS 

The R3000 uses three different addresses for exception vectors: 

• The RESET exception vector is at address OxbfcOOOOO 

• The UTLB Miss exception vector at address 0x8000000 

• The General exception vector for all other exceptions at address 
0x80000080 


LATENCY FOR EXCEPTION PROCESSING 


Different types of exceptions can occur in different stages of the 
pipeline. The exception handling routine itself occurs after a one 
cycle latency. The R3000 has a five stage pipeline that consists of 
instruction fetch, instruction decode, ALU operation, cache fetch, 
and the write-back stage. Table 1 shows in the last column the 
number of instructions in the pipeline that need to be flushed on an 
exception. Address error, for example, can have a maximum 
latency of four if it occurs on a memory operation cycle. This is 
because four instructions in the pipeline stage have to be flushed. 
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Error 

Pipeline stage 

Pipeline stages 
to be flushed 

Illegal Instruction 

Instruction Decode 

2 

Address Error 

Memory Operation 

4 

Interrupts 

Instruction Fetch 

3 

Overflow 

ALU Operation 

3 

TLB Miss 

Instruction Decode 

2 


Table 1. Latency In the R3000 on an exception 


Figure 4 shows the pipeline stages in the R3000. The different 
stages are lilnstruction fetch; R:Read Decode; A:ALU operation; 
M:Memory operation; and W:Write-Back. When there is an 
exception on an instruction fetch cycle, the exception routine starts 
executing one clock cycle later as shown. 



INTERRUPTS IN THE IDT79R3000 

The R3000 processor has six general purpose hardware 
interrupts and two software generated interrupts. The hardware 
interrupts are sampled during phase 2 of all run and fixup clock 
cycles. This is shown in Figure 5. tDS is the data setup time, tHLD is 
the data hold time and tSMP is the phase delay between the 
Clk2xSmp input and the Clk2xPhi input. These two clock inputs are 
part of the four phase clock inputs given to the processor and are 


useful for selecting the proper static RAM parts for interface 
considerations. The interrupts are level-sensitive. They continue to 
be sampled during phase 2 of the clock cycle after an interrupt 
exception has occurred. The interrupts are not latched within the 
processor when an interrupt exception occurs. Since the interrupts 
are not sampled during stall cycles, BusErr* can be asserted and 
used for exception processing. This is useful in cases where there is 
a need to abort from a stall mode. 


FIXUP 


PHASE 1 


SysOut* 


RUN 


PHASE 2 


PHASE 1 


PHASE 2 


/“X 


Int* 


x i^c~ 

f- tos“l 

H b tDH 

I *SMP [~ 

Figure 5. Interrupt Timing Diagram 
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Each of the eight interrupts can be individually masked by 
clearing the corresponding bit in the IntMask field of the Status 
Register. All eight of the interrupts can be masked at once by 
clearing the lEc bit in the Status Register. 

INTERRUPT HANDLING 

The R3000 branches to the general exception vector at address 
0x80000080 for the exception. The R3000 sets the Int code in the 
Cause Register’s ExcCode field. The IP field in the Cause register 


shows which of the six hardware interrupts are pending and the SW 
field in the Cause register shows which of the two software 
interrupts are pending. Multiple interrupts can be pending at one 
time. 

When the interrupt occurs, the KUp, I Ep, KUc and I Ec bits of the 
Status register are saved in the KUo, lEo, KUp, lEp bit field in the 
Status register. The current kernel status bit KUc and the interrupt 
bit lEc are cleared. See Figure 6. This masks all the interrupts. 






KUo 

lEo 

KUp 

lEp 

KUc 

lEc 





KUo 

lEo 

KUp 

lEp 




Figure 6. Kernel Status and Interrupt Status Are Saved on Interrupts 


INTERRUPT SERVICING 

In case of an hardware interrupt, the interrupt must be cleared by 
deasserting the interrupt line. This has to be done by alleviating the 
conditions that caused the interrupt. Software interrupts have to be 


cleared by setting the corresponding bits (SW1:0) in the Cause 
register to zero. A flow chart of a general exception routine handler 
is shown in Figure 7. 



Figure 7. Flow Chart for Exception Handling 


An example piece of code is given below in Figure 8. It illustrates 
a simple service routine that the processor branches to on 
detecting an interrupt. The actual interrupt handling code itself will 
depend on the user’s application and, therefore, is not given, (an 
and sn are registers in the R3000.) 

As soon as the branch to the address is taken on an interrupt, the 
exception program counter is saved. Line 2 indicates the reading of 
the cause register to determine the exception (in this case an 


interrupt). The status register is saved to be restored after 
processing the exception. A lookup table contains the addresses of 
all the different exception processing routines. A jump is taken to 
the appropriate exception routine. 


1 . 

mfcO 

aO.EPC 

;save exception PC 

2. 

mfcO 

a3,C0_CAUSE 

;get CAUSE register 

3. 

mfcO 

s0,C0_STATUS 

;save status register 
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4. 

and 

5. 

Iw 

6. 

j 

7. intr: 

and 

8. 

Iw 

9. 

i 


al ,a3,CAUSE_EXC_MASK ;get entry to 
Index into look up table 


a2,causevec(a1) 

a2 

a4,aO,INT_CAUSE 

t1,int_level(a4) 


;register a2 with address 
in look up table 
;jump to service routine 
(line 7) 

;load interrupt level in 
register 

; index into interrupt level 
‘n’ routine. 

;go to interrupt routine for 
level ‘n’ 


tl 


;return from interrupt 
routine 


10. mtcO aO,EPC 

11. mtcO s0,C0_STATUS ;restore status register to 

previous value 

12. rfe ;restore status and 

interrupts prior to 
exception exit 

Figure 8: Interrupt Service Routine Example 

Figure 9 illustrates a simplistic block diagram of an R3000 board 
with the interrupt lines connected to a PAL device. The PAL logic is 
designed to affect the R3000 run-time behavior and it defines the 
state of the interrupt lines during Reset. Accordingly, the R3000 can 
be initialized to work as a big-endian or a little-endian processor, its 
block refill rate can be varied, etc. 



Figure 9: Block Diagram of Interrupt Controller on Reset 


CONCLUSION 

The IDT79R3000 provides both flexible and fast exception 
handling capability. Once an exception occurs, the first instruction 
of the exception handler routine is fetched on the very next clock 
cycle, providing minimal latency. Management of the processor 
and system state is left to the exception handling software, allowing 
the system designer to determine what must be done to respond to 


a given exception and thus minimizing the amount of processor 
overhead required to handle exceptions. Even the prioritization of 
the external interrupts is under software control, providing the 
system designer with maximum flexibility in the target system*. 

‘Note: Chapter 5 of the “MIPS RISC Architecture” Book, available 
from IDT, contains further detail on exception processing of the 
79R3000. 
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INTRODUCTION 

The reduced instruction set computer (RISC), the IDT79R3000, 
has allowed for simplicity in hardware and synergy between 
architecture and compilers. To further increase the throughput of a 
computer system, direct-mapped cache memory is implemented 
on systems using the R3000. The availability of a wide variety of 
high-speed static RAMs from IDT gives the designerthe flexibility of 
selecting the proper part for his application. It is necessary, 

CACHE DESIGN 


however, to know the critical timing parameters governing the 
design of a cache subsystem. This article is divided into three parts. 
The first part shows a general cache system with a description of 
the clock inputs. The second section details the equations used to 
calculate the critical parameters. It is followed by an example of an 
IDT7198 static RAM used as a cache RAM for the R3000. 



Figure 1 . 64KB Instruction/Data Cache Configuration 


A simplistic block diagram implementation of a 64KB separate 
instruction cache and data cache is shown in Figure 1 . The design 
of a cache subsystem such as the one above depends on the four 
input clocks to the R3000 processor. These clock inputs are twice 
the frequency of the output clock i.e„ SysOut. By adjusting the 
timings of these clocks, the designer can accommodate a wide 
variety of static RAMs by properly considering specific parameters 
such as set-up and hold times. The clocks themselves can be 
adjusted using tap settings on a delay line or by using delay logic. 
The clock Inputs are described below. 

1) Clk2xSyS: Determines the position of SysOut with respect to the 
data, tag, and address buses. It is positioned so that devices in 


the cache/bus interface clocked by SysOut meet the set-up and 
hold time requirements. 

2) Clk2xSmp: Is used by the R3000 to capture external data onto 
data bus and control inputs. 

3) Clk2xRd: Is used to delay the enable of data bus drivers. 

4) Clk2xPhi: Is used to determine all R3000 outputs i.e., data, 
address, and tag buses. 

Figure 2 shows the timing relationships between the four clocks. ■ # 
All the timing equations for cache design depend on the phase 
relationship between these clocks. Tsmp is the Clk2xSmp to mm 
Clk2xPhi delay, Trd is the Clk2xRd to C!k2xPhi delay, and Tsys is 
the Clk2xSys to Clk2xPhi delay. 
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Figure 2. Timing Relationships Between the Four 2x Clock Inputs 


In the cache implementation scheme, instruction references can take place in different phases of the same clock cycle. See 
begin their reference during phase 2 and transfer data during the Figures3aand3b.Thisisanimportantfactorto consider in orderto 

following phase 1. Data references begin during phase 1 and prevent contention between instruction and data caches, 
transfer data during phase 2. Thus, data and instruction references 



Figure 3a. Data and Instruction Caches During Phase 1 
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Figure 3b. Data and Instruction Caches During Phase 2 


Specific factors such as access time, set-up time, hold time, 
enable and disable times, and the deration factor are key in 
choosing the proper static RAM and in setting the phase delays in 
the clocks. The next section discusses the timing equations 
needed for selecting a static RAM for cache design. 

Equations Governing the Critical Parameters in 
RAM Selection: 

Figure 4 shows the timing of the relevant signals for a 25MHz 
(40ns) R3000. The numbers represent the equations that are critical 
in determining the selection of the static RAMs. The timing is given 
for the worst case condition i.e., a STORE followed by a LOAD. An 
explanation of the equations is given below. 

Internal Sample to Phi: This timing parameter requirement 
guarantees that the processor internal sample to Phi is met. 
f smp 0 5r>s (1) 

Address Access to Data Sample: This timing parameter 
requirement guarantees that the cache RAMs have sufficient 
access time. This calculation assumes that the address delay 
through the FCT373 is limited by its propagation delay. 

RAM d M 0 t C yc~ f smp ~ AdrLo dr - 373 pq - t os — (?) 

Cache Enable to Sample: This timing parameter requirement 
guarantees that the cache RAMs are enabled soon enough to meet 
the processor’s input set-up specification. 

RAMOSE T cyc/2~ f smp-rd~ Rd d - t os (3) 

Minimum Read Pulse Width: This timing parameter requirement 
guarantees that the read pulse generated by the processor is at 
least as long as the cache RAM output enable time. 

RAM d op Q t cyc!2~ * sys-rd (4) 


Read Write 1-Cache Data Bus Contention: This timing parameter 
requirement guarantees that no contention will occur between the 
instruction cache and the processor on a store. 

RAMhz 0 f sys- Rd d + D en (5) 

Processor Data Set-up to End of Write: This timing parameter 
requirement guarantees that the cache RAMs have adequate data 
set-up time when being written into by the processor. 

RAMsd 0 t cyc/2~ f smp ~ DVal d + Wr d - (6) 

Data Hold from End of Write: This timing parameter requirement 
guarantees that the data hold from end of write specification of the 
cache RAMs is met when either the processor or the read buffer is 
writing to the RAMs. 

RAM hd -RAMlz Bfsmp-rd (7) 

Data Set-Up to SysClk: This timing parameter requirement 
guarantees that the set-up time into an external register is met on a 
processor store. 

SetUp S ys 0 t C y C i 2 - t sys - (DVal d + - Sys d - 240 pq)-- (8) 

Data Hold from SysClk: This guarantees that the hold time 
specification of an external register is met on a processor store. The 
data holds on the bus until a subsequent read drives new data. 
Hold§yg 0 f sys-rcf Sys d - 240 pd+ RAM lz\- Rd d - (9) 

Equations 1 to 9 are sufficient for the purpose of selecting the 
proper RAMs for use as cache memory. To illustrate the point 
further, an IDT RAM device, the IDT7198 (16K x 4), is chosen as an 
example. 



*d: Deration due to additional load. Ins per 25pF. 
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phi phi phi 

STORE (phase 2) FETCH (phase 1) 



Figure 4. IDTR3000 40ns (25MHz) Cycle Timing 


CALCULATION OF TIMING PARAMETERS 

An example of a cache subsystem design using an IDT7 198 i.e., 
a 16 K x 4 static FtAM follows. All the numbers used in the 
calculations have been taken from the IDT Data Book 1 and the 
R3000 Interface Manual. The numbers are presented in Figure 6. 
The deration factor has been taken into account for DIPs. Surface 
mount would decrease the deration factor. 

The following factors have been taken into account for 
calculating the deration factor 2 . 

1) The SSI logic and cache RAM propagation delays are derated 
by Ins per 25pF of additional load. 

2) Cache RAM input capacitance is 5pF. 

3) Cache RAM output capacitance is 7pF. 

4) Trace capacitance is estimated at 2pF per inch. 

5) Data and trace tag buses are 6 inches. 

6) Address buses are 2 x 5 inches. 

7) SysOut loading is 50pF. 

8) Test value of 30pF to be subtracted. 


Deration Calculations: 

Address Capacitance: 12 x 2pF = 24pF; (factors 4,5, and 6) 5 
Devices = 5 x lOpF = 50pF; Test value = -30pF; Total 
capacitance = 45pF; At Ins per 25pF, total deration of address bus 
= 2ns. 

Data Bus Deration: Is approximately the same i.e., 2ns. Read 
control capacitance for IDT7198 will be about 10 inches of trace 
and 8 devices at 7pF each. Therefore, Read control deration = 
(76-30)pF/25pF/ns = 2ns. 

In Figure 5, the circled numbers are the equations previously 
described. The number in parentheses is the allowable worst case 
timing. The adjacent number is the total time taken using the 
IDT7198. The numbers for the IDT7198 with the R3000 running at 
different frequencies and the FCT373A are shown in Table 1 and 
Table 2 respectively. 

The first value for each parameter inTable 1 shows the maximum 
allowable worst case rating and the second value shows the timing 
using the IDT7198 RAM. 
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Parameter 

Load 

(PF) 

Symbol 

R3000 Clock Frequencies | 

Min. 16MHz Max. 
(ns) (ns) 

Min. 20MHz Max. 
(ns) (ns) 

Min. 25MHz Max. 
(ns) (ns) 

Address to Data Valid 

30 

tAA 

31 

25 

19 

29 

25 

19 

Output enable to Data Valid 

30 

toOE 

17 

13 

10 

15 

13 

10 

Output Disable time 


tHZ 

14 

12 

2 8 

12 

10 


Output Enable Time 


tl_Z 

2 

2 


5 

5 


Address SetUp to End of Write 


tAW 

43 

36 

27 

20 

20 

13 ; 

Data Setup to End of Write 


tos 

14 

13 

11 

13 

13 

8 

Write Pulse Width 


tpWE 

55 

47 

37 

20 

20 

13 

Data Hold from End of Write 


tHD 

0 

0 


0 

0 

0 

Address Hold from End of Write 


tHA 

0 

0 

0 

0 

0 



Table t. Cache RAM Parameters. RAM Specifications vs. IDT7198 Specifications. 


Parameter 

Load 

(Units) 

Symbol 

Min. 

(ns) 

Max. 

(ns) 

FCT373A Propagation Delay 

50 

*373 PD 


5.2 

FCT373A Latch Enable Delay 

50 

*373 LE 

2 

8.5 

FCT373A Latch Enable Hold 

50 

*373 Hid 

1.8 


FCT240A Prop Delay 

50 

*240 PD 

1.5 

4.8 


Table 2. Parameters for 


Figure 6 shows a block diagram of tap settings on a delay line for 
the four clock input signals. By varying the phase delay between 
these signals, the designer can select the proper static RAMs for 
cache memories and the operating frequency of the R3000. Table 3 
shows suggested tap settings on the delay line for the R3000 
running at different frequencies. 



Clk2xsys Clk2xSmp Clk2xPhi 

Clk2xRd 


Latches and Buffers 

The designer can use a DDU-7F-20* chip for the delay line. The 
clock is the input to the device and the outputs at various points can 
be chosen with the appropriate phase delays. 

CONCLUSION 

The I DT R3000 RISC processor al lows an efficient cache system 
to be implemented with standard architecture static RAMs. To 
design a cache subsystem, it is essential to know only the critical 
equations mentioned above and their relation to the four input 
clocks. The tap settings provide further control of the cache 
subsystem design for different operating frequencies of the R3000. 

REFERENCES 


Figure 6. Tap Settings for the Clock Inputs 


Parameter 

16.67MHz 

20MHz 

25MHz 

Clk2xSys 

0 

0 

0 

Clk2xRd 

6 

6 

6 

Clk2xSmp 

6 

6 

6 

Clk2xPhi 

16 

14 

12 


1. IDT Data Book, pp 4-74 — 4-83, pp 10-72 — 10-75. 

2. MIPS R3000 Processor Interface Manual, pp 105. 

*d: deration due to additional load. 1 ns per 25 pF. 


‘Available from Data Delay Devices (201) 772-1106 



Table 3. Delay Line Setting Summarization 


9.3 


5 











































































































Integrated Device TechnoJogy Inc 


USING THE IDT79R3000 
IN A MULTIPROCESSOR 
ORGANIZATION 


APPLICATION 

NOTE 

AN-28 


By Roy M. Johnson 


INTRODUCTION 

High performance systems, such as shared memory 
multiprocessor architectures, can be built using IDT79R3000 RISC 
processors. The IDT79R3000 incorporates special features that 
provide support for multiprocessor environments. This 


applications note discusses the features of shared memory 
multiprocessor architectures with local caches, examines the 
critical issue of cache coherency, and demonstrates how the 
features of the IDT79R3000 facilitate its use in a shared memory 
multiprocessor system. 


A SHARED MEMORY MULTIPROCESSOR ORGANIZATION 



Figure 1. Block Diagram of a Shared Memory Multiprocessor System 


SHARED MEMORY MULTIPROCESSOR 
SYSTEMS 

A simplified block diagram of a shared memory multiprocessor 
with local caches is shown in Figure 1. This model of a 
multiprocessor system is defined to be tightly coupled and the N 
processors are connected to M memory modules and P I/O devices 
via an interconnection network. All the processors have a local 
cache memory, share the same global address space and 
communicate via shared memory. The interconnection network 
ensures complete connectivity between the processors and 
memory modules and can be implemented as a simple shared 
bus, multi-stage delta network or a more complex cross-bar switch . 
The global shared address space is assumed to be interleaved 
amongst the memory modules in order to minimize memory 
access conflicts. Note that the need for a interconnection network 
can be obviated by using a multi-port memory [1], Examples of 
commercial machines employing a shared memory 
multiprocessor configuration using the R2000/3000 RISC 
processor include the Titan Graphics Supercomputer from Ardent 
Computers [2] and the 4D-MP Graphics Superworkstation from 
Silicon Graphics [3]. 


CACHE COHERENCY 

The presence of local caches in a shared memory 
multiprocessor system introduces the issue of cache coherency 
that may result in data inconsistencies. This problem arises 
because several copies of the same data may exist in local caches 
of different processors at the same time. If one of the processors 
modifies (writes) the value of its copy of the data, then the other 
processors will have the stale or incorrect copy of the modified data 
in their local caches. This is a potential problem created by 
asynchronous parallel algorithms that do not have explicit 
synchronization. Data inconsistencies may also arise in 
multiprogrammed multiprocessor systems whereby a suspended 
process may migrate to another processor and the most recently 
updated data of the process might still be in the original processor’s 
local cache. When the process is run on the new processor, there is 
a possibility that stale data is used if the local cache was not 
previously flushed. This assumes that the process did run 
previously on this processor. It is clear that if data consistency is to 
be ensured in a multiprocessor system, cache coherency must be 
maintained. 
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CONTROL 


CONTROL 


IDT79R3000 ^ MP IN . y ALI[ ^I . E 
PROCESSOR 1 MP REQUEST 


IDT79R3000 
PROCESSOR 2 


ADDRESS 


DATA 


DATA 


D CACHE 


I CACHE 


D CACHE 


I CACHE 


ADDRESS 


SNOOP 
CACHE & 
CONTROL 
LOGIC 


SNOOP 
CACHE & 
CONTROL 
LOGIC 


BUS CONTROL 
LOGIC 


MEMORY INTERFACE 
READ & WRITE BUFFERS 


MEMORY INTERFACE 
READ & WRITE BUFFERS 


COMMON MEMORY BUS 


ADDRESS, DATA & CONTROL 


I/O DEVICE 


MAIN MEMORY 


Figure 2. Block Diagram of a Dual IDT79R3000 Shared Memory Multiprocessor 


A static approach to maintain cache coherency is to make all 
writeable data that is shared, non-cacheable. This method ensures 
data consistency, but at the price of decreased performance and 
with increased main memory conflicts. A dynamic approach to 
maintain cache coherency is to allow multiple copies of shared 
writeable data to exist and rely on a cache coherence protocol 
between the processors to ensure cache consistency. Several 
cache coherence protocols have been proposed and implemented 
using both hardware [4] and software support [5]. The type of 
protocol used depends primarily on interconnection network and 
the number of processors in the system. 

A DUAL IDT79R3000 SHARED MEMORY 
MULTIPROCESSOR 

A simplified block diagram of a dual IDT79R3000 shared 
memory multiprocessor is shown in Figure 2. A simple shared bus 
configuration was chosen for clarity. The two processors are 


connected to the main memory and an I/O device via a common 
bus. Access to the shared bus is arbitrated by the bus control logic. 
Each processor has an instruction and data cache and 
write-through cache update policy is assumed, i.e. all writes to the 
cache are also immediately transmitted directly to main memory. 
Note that a write-back cache update policy, (writes done only to the 
cache and main memory is updated when the cache line is 
replaced) would generate less memory traffic [10]. This is usually 
implemented when there are more than two processors in the 
system. Read and write buffers are included to provide a 
convenient asynchronous interface to the main memory. The 
snoop cache and control logic is used to implement a dynamic 
cache coherency check mechanism. For clarity, a very simple 
cache coherence protocol is chosen for the dual IDT79R3000 
multiprocessor system and is described in detail below (more 
sophisticated and efficient schemes are described in [4], [5], [6] , 
[7] & [8]). 
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Figure 3. Processor Interface to the Snoop Control Logic 


CACHE COHERENCE PROTOCOL 

Each snoop cache maintains a directory of the current entries in 
the local data cache, (i. e. it contains the tags of all the current 
entries in the local data cache). Its primary function is to monitor the 
external memory bus for an address match. In addition, the snoop 
cache maintains state information for each data cache line. A 
cache (tag) line can be in one of three states: private, shared or 
invalid. Data that is exclusive to the processor is marked private, 
data that is common to the processors is marked shared and data 
that is inconsistent is marked invalid. The snoop cache is updated 
concurrently with the data cache. Whenever processor 1 modifies 
or writes a line that is marked shared in its local cache, its snoop 
control logic signals processor 2 that a write to a shared line has 
occurred. The snoop control logic of processor 2 then interrogates 
its snoop cache to determine whether a copy of the modified data is 
present in the local data cache. If a copy is present, it is invalidated 
using the MP request and MP invalidate signals as shown in the 
Figure 2 and the tag line in the snoop cache of processor 2 is 
marked invalid. The snoop control logic of processor 2 sends an 


acknowledge signal to processor 1 which then proceeds to 
complete its write operation to the shared location, i.e. writes into 
the data cache as well as into the write buffer. It must be noted that 
the data value in the write buffer must be retired to the main memory 
before the write operation can be completed. This prevents 
possible data inconsistencies that may arise by processor 2 trying 
to read that particular main memory location before it is updated. 
This cache coherence protocol is also known as 
cross-interrogation. Note that this protocol is applicable only to 
cache lines that are marked shared, while writes to cache lines 
marked private are performed at the processor speed. In the event 
of simultaneous writes to the same shared cache line by both the 
processors, only one of the processors will successful ly acquire the 
external bus (determined by the bus arbitration logic) to issue a 
cross-interrogation signal to the other processor. The write 
operation of the processor that did not acquire the external bus will 
result in a write miss. Figure 3 shows a typical processor interface to 
the snoop cache and control logic in more detail, and is also 
described below. 
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STALL 


STALL 


STALL 


STALL 
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HIGHZ 


HIGHZ 


HIGHZ CD 


CPCOND(3) (MP REQUEST) 


CPCOND(2) (MP INVALIDATE) 


LATCH ENABLE* 


INVALIDATE 
LATCH ENABLE* 


Figure 4. Cache Invalidation Timing Diagram 


DYNAMIC CACHE COHERENCY CHECK 
MECHANISM 

The signals at the snoop logic - processor interface include the 
MP request, MP invalidate, processor latch enable, invalidate latch 
enable and the invalidate address (address of the cache location to 
be invalidated). The snoop logic receives a cross- interrogation 
signal from the other processor when a write is performed to a 
shared cache line. It then searches its tags for an address match. If 
a match occurs, the address is captured in the invalidate address 
register which is clocked by SysOut*, as shown in the Figure 3. The 
CpCond(3) input (MP request signal) of the IDT79R3000 is then 
asserted, causing the 79R3000 to enter into a MP stall. As there is no 
cache activity on the first cycle of an MP stall, the processor latch 
enable signal is deasserted and the invalidate latch enable is 
asserted in order to present the invalidate address to the data 
cache. After the first stall cycle, the CPU will issue DRd* pulses 


during every phase 2 and DCIk (connected to the transparent 
latches) during every phase 1 , this lasts until the end of the stall or 
until one cycle after the assertion of CpCond(2). This permits the 
snoop logic to read the data cache (Data and Tag values can be 
sampled by the falling edge of SysOut*) in order to determine 
whether an invalidation isto be performed. If the cache location is to 
be invalidated, the MP invalidate signal (connected to the 
CpCond(2) input of the 79R3000) is asserted. Invalidation occurs 
by the assertion of Dwr* during phase 2 of the stall cycle with an 
arbitrary invalid tag and arbitrary data value driven onto theTag and 
Data buses. If CpCond(2) is deasserted while CpCond(3) is still 
asserted, the processorwill return to issuing DRd* pulses to enable 
data cache reads. The cycle after CpCond(3) is deasserted 
contains no cache activity. This cycle is used to re-enable the 
processor’s transparent latch and disable the invalidate 
transparent latch. A detailed timing diagram of a snapshot of the 
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cache invalidation process is shown in Figure 4. This is a modified 
version of the timing diagram shown in [11]. Note that Figure 4 
shows the minimal timing required. CpCond(2) is asserted two 
cycles after CpCond(3) is asserted and before the first Drd*. This 
implies that the data location is i nval idated i nrespecti ve of the val ue 
being read. The symbol “cD” denotes that the cache drives the data 
and tag buses when CpCond(3) is asserted. The symbol ”pD” 
denotes that the processor drives the data and tag buses when 
CpCond(2) is asserted. The snoop control logic, atthis stage, must 
mark the tag line in its snoop cache as invalid and send an 
acknowledge signal to the other processor. This indicates that the 
cache invalidation is complete. If desired, more sophisticated and 
efficient invalidation schemes, such as techniques for block 
invalidation, could be implemented. 

SECONDARY CACHE SCHEME 

The cache-main memory interface described above could be 
made more efficient by using a system of multi-level caches [9], 
[12], to provide additional memory bandwidth. For instance, a 
secondary cache that is four times the size of the first level or 
primary cache could be implemented. The secondary cache is a 
superset of the primary cache and also includes state information 
to maintain cache coherency. The cache update policy is typically 
write-through, from the primary to the secondary cache and 
write-back from the secondary cache to main memory. Since the 
primary cache is always a subset of the secondary cache, 
consistent data is guaranteed. This type of multi-level cache 
organization is implemented in the 4D-MP Graphics 
Superworkstation [3] made by Silicon Graphics. 

CONCLUSION 

Maintaining cache coherency is vital in shared memory 
multiprocessors. The implementation of the cache- main memory 
interface and the cache coherency protocol are critical issues. The 
IDT79R3000 RISC processor provides features that facilitate the 
implementation of cache coherence check mechanisms with 
minimum hardware and is well suited to be used in a shared 
memory multiprocessor environment. 
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INTRODUCTION 

One of the ways RISC processors attempt to achieve single-cy- 
cle execution is by pipelining instruction execution. Instruction flow 
through the pipeline can be impeded, however, due to the latency 
effects of instructions such as loads, stores and branches. As a 
consequence of these latency effects, single-cycle instruction exe- 
cution cannot be maintained without a loss of pipeline efficiency. 
This article uses a simple probabilistic model to measure the effect 
of load, store and branch instructions on pipeline performance in 
the Am29000, MC88100, SPARC and the R3000 RISC proces- 
sors. The analysis shows that the pipeline architecture of the 
R3000 processor is able to minimize the latency effects of loads 
and branches to a greater extent than the other RISC processors 
and thereby maintain higher pipeline efficiency. 


INSTRUCTION PIPELINE 

An instruction pipeline can potentially increase the throughput of 
a processor by overlapping the execution of several different in- 
structions. In other words, at any given time in an instruction pipe- 
line, several instructions can be in different stages of execution. 
Figure 1 shows the stages of a typical instruction pipeline of a RISC 
processor with a load/store architecture (only load and store in- 
structions access memory and all other operations are performed 
on registers). After the initial start up latency to fill the pipeline, an 
instruction theoretically can complete execution on every clock cy- 
cle as shown in Figure 2. 



Figure 1. Stages of a Typical R3000 Instruction Pipeline 


However, because of data dependencies between instructions 
in the pipeline and latency effects of instructions such as loads and 
branches, single-cycle instruction execution cannot be main- 
tained. For instance, in the case of conditional branch instructions, 
if the branch condition is evaluated in the ALU stage of the pipeline 
(as shown in Figure 1 ) at atime t, and if the branch condition is true, 
the target instruction can only be fetched at time t + 1 . The instruc- 
tions fetched at times t - 3, t - 2 and t -1 (assuming sequential in- 
struction fetch continues after the branch instruction is fetched) will 
have to be discarded or flushed from the pipeline. Thus, there are 3 
cycles in which no useful work is done. Similarly, in the case of a 
load instruction, because of finite memory access time, the oper- 
and of the load instruction will not be available for the next instruc- 
tion in the pipeline (typically the instruction after the load uses the 
data of the load instruction). Therefore, the pipeline will have to be 
stalled while the memory request is satisfied. Store instructions 
can similarly stall the pipeline if both instructions and data use the 
same single port memory. To prevent memory access conflicts, 
the instruction after a store cannot be fetched until the store is com- 
pleted. The examples described above illustrate the loss in pipe- 
line efficiency due to the latencies of branch and memory refer- 
ence instructions. 

Techniques to improve pipeline efficiency by reducing the 
branch penalty include the use of branch target buffers [1], dy- 
namic branch prediction strategies and delayed branching 
schemes [2], Latency effects of memory reference instructions can 
be minimized by the use of a Harvard architecture with a split 


cache memory for instruction and data [3] and instruction pre- 
fetching schemes. 

The effect of load, store and branch instructions on pipeline per- 
formance can be approximated using a simple probabilistic model. 
This mathematical model is an extension of the one proposed by 
D. J. Lilja [4] for measuring the effect of branch penalty in pipelined 
processors. The model is described below and is used to compare 
the pipeline performance of the Am29000, MC881 00, SPARC and 
the R3000 RISC processors. 
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PIPELINE PERFORMANCE MODEL 

Consider a pipelined RISC processor in which all instructions ex- 
cept memory reference instructions and branches can be exe- 
cuted on every clock cycle, i.e. assume single cycle execution for 
all instructions except loads, stores and branches. Then the aver- 
age number of cycles per instruction (CPIave ) is given by: 

CPIave = Pb(1 + b) + Pm(1 + m) + (1 - Pb - Pm)(1 ) (1 ) 

CPIave= average number of cycles per instruction 
Pb = probability that an instruction is a branch 
b x branch penalty (number of cycles wasted in the pipeline 
if an instruction is a branch) 

Pm = probability that an instruction is a memory reference 
(load or store) 

m = memory reference penalty (number of cycles wasted in 
the pipeline if an instruction is a memory reference, i.e. 
the memory reference latency ) 

Pipeline efficiency can then be computed as the ratio of the mini- 
mum number of cycles per instruction (maximum performance 
case, which is 1 cycle for this model) to the average number of in- 
structions per cycle: 

Pipelineeff = 1 (2) 

CPI ave 

where Pipelineeff = the pipeline efficiency 

Note that if b = 0 and m = 0, i.e. there are no branch or memory 
reference penalties, then 

CPIave » 1 from Equation (1) 
and Pipelineeff = 1 from Equation (2) 

In other words, when the average number of cycles per instruc- 
tion is 1, then the pipeline operates at maximum efficiency. 

The key assumptions of the pipeline model represented by 
Equation 1 are: 

1 ) The execution of all branch and memory reference instruc- 
tions will result in a pipeline penalty. This may not be true in all 
RISC architectures especially with reference to conditional branch 
instructions where the pipeline penalty will depend on whether or 
not the branch is taken. Similarly, the memory reference penalty 
will depend on whether the instruction is a load or a store. 

2) The model also assumes that a new instruction can be 
fetched on every clock cycle, i.e. a 100% instruction cache hit rate 
which is the ideal case. 

Equation 1 should be modified to accurately model a target archi- 
tecture. The RISC architectures in this comparison (the Am29000, 
SPARC, MC88100 and R3000) use a delayed branching scheme. 
A processor with a branch delay of b cycles will always execute b 
instructions after a branch instruction, whether the branch is taken 
or not. The compiler (for a processor using delayed branches) de- 
termines the instruction dependencies and reorganizes the in- 
struction stream to fill the branch delay slots with useful instruc- 
tions. In many cases some or all of the b delay slots after the 
branch must be filled with no-ops. Gross and Hennessy [5] devel- 
oped an algorithm for optimizing delayed branches in the MIPS 
project and have shown that the first branch delay slot can be filled 
with a useful instruction more than half the time while subsequent 
delay slots are increasingly harder to fill. Equation 1 can be modi- 
fied for a RISC architecture with a delayed branching scheme with 
b branch delay slots as follows: 


CPIave = Pb(1 +bnop) + Pm (1 + m) + (1 -Pb-Pm)(1) (3) 


where, b = number of delay slots after a branch instruction the 
effective number of cycles/branch x 1 + bPnop and the effective 
number of cycles/memory reference = 1 + m 
if fi is the probability that a delay slot i is filled with a useful instruc- 
tion, then 


Pnop = 1 


fi + f2 + + fb 

b 


(4) 


Thus, Pnop shows the fraction of delay slots that do no useful 
work and thereby add to the branch penalty. 

Equation 3 can be further reduced to: 


CPIave = 1 + bPbPnop+ mPm 


In other words the average number of cycles per instruction is 1 
plus a fraction due to the branch penalty plus a fraction due to the 
memory reference penalty. The pipeline model represented by 
Equation 3 will be used in the architectural comparisons made in 
the subsequent section. 

INSTRUCTION MIX 

The format of the instruction mix used in the performance com- 
parisons is shown in Table 1. This instruction mix is typically repre- 
sentative of non numeric code (no floating point operations in- 
cluded) and is obtained from a suite of integer benchmarks per- 
formed by MIPS Computer Systems. It is important to note that 
even though an instruction mix is program dependent, a fairly sig- 
nificant portion of the total number of instructions usually consists 
of load, store and branch instructions. In the analysis it is assumed 
that conditional branch instructions are taken 65% of the time [4] 
and the instruction immediately following a load always uses the 
data of the load instruction. 


INSTRUCTION TYPE 

RELATIVE FREQUENCY 

ALU operations 

.55 

Branch instructions 

.15 

Load Instructions 

.20 

Store Instructions 

.10 

TOTAL 

1.0 


Table 1. Instruction Mix Format 


PERFORMANCE COMPARISONS 

This section uses the pipeline model represented by Equation 3 
and the instruction mix described in Table 1, to compare the the 
pipeline efficiencies of the Am29000, SPARC, MC88100 and the 
R3000 RISC architectures. It is assumed that the branch delay slot 
can be filled with a useful instruction 50% of the time in all cases [5]. 

Am29000 

The Am29000 RISC processor has a 4 stage instruction pipeline, 
provides an interface to separate instruction and data caches and 
also includes an on-chip branch target cache [7]. The processor 
also includes a 4 deep instruction pre-fetch buffer. All branch in- 
structions execute with a single branch-delay slot. If the branch is 
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taken and the target address is found in the branch target cache, 
there is no extra branch penalty (except for the single delay slot). 
However, in case of a branch target cache miss, there is a penalty 
of at least 2 cycles and the probability of a branch target buffer miss 
will be assumed to be .25 [4]. Load instructions execute with a la- 
tency of at least 1 cycle and a store instruction will cause a pipeline 
hold mode of at least one cycle if overlapped with a load. Equation 
3 can be modified to to model the Am29000 and reduces to: 

CPIave = 1 + bPbPnop + PbPt Pm C + 1 Pi + SPs (6) 

CPIave = average number of cycles per instruction 
Pb = probability that an instruction is a branch (.15) 

b = number of branch delay slots (1) 

Pnop = probability that an instruction in a branch delay slot is a 
no-op (.5) 

Pt = probability that branch is taken (.65) 

Pm = probability of a miss in the branch target buffer (.25) 

c = number of cycles wasted due to a branch target buffer 

miss (2) 

Pi = probability that an instruction is a load (.2) 

Ps = probability that an instruction is a store (.1) 

I = load latency (1) 

s = store latency (1 ) 

effective cycles/load = 1+1 =2 

effective cycles/store = 1 + s =2 

effective cycles/branch = 1 + bPnop + Pt Ptm c = 1 .825 


branching scheme is implemented with a branch delay slot of 1 cy- 
cle. The target instruction of a branch is always fetched regardless 
of whether the branch is taken or not. In the case of a conditional 
branch that fails, there is an additional penalty to flush the pipeline 
and fetch the correct instruction sequence [6], Loads take 2 cycles; 
the address of the operand is computed in the ALU stage (stage 3) 
of the pipeline and the cache access is made in stage 4. A new in- 
struction cannot be fetched during stage 4 of a load instruction as 
both instruction and data share the same memory and therefore a 
load effectively takes 2 cycles. For similar reasons, store instruc- 
tions take 3 cycles . If it is assumed that there is a branch penalty of 
at least 2 cycles for a conditional branch that fails (i.e. to flush the 
first two stages of the pipeline), then equation 3 then reduces to: 

CPIava = 1 + Pb bPnop + (1 — Pt) 2) + IPl + SPs (7) 

Pb = probability that an instruction is a branch (.15) 

b = number of branch delay slots (1) 

Pnop = probability that an instruction in a branch delay slot is a 
no-op (.5) 

Pt = probability that branch is taken (.65) 

Pi = probability that an instruction is a load (.2) 

Ps = probability that an instruction is a store (.1 ) 

I = load latency (1) 

s = store latency (2) 

effective cycles/load =1+1=2 

effective cycles/store = 1 + s =3 

effective cycles/branch = 1 + bPnop + (1 - Pt )2t = 2.2 


The numbers in parentheses are the actual values assigned to 
each of the factors. Using these values in Equation 6, the average 
number of cycles per instruction is computed to be: 

CPIave =1.424 cycles 
and the pipeline efficiency is computed to be 
Pipelineeff = 1 /1 .424 = .702 
from Equation 2 

Therefore, for the instruction mix described in Table 1 , the pipe- 
line operates at an efficiency of 70.2% . 

SPARC 

The SPARC processor has a 4 stage instruction pipeline, in- 
cludes a large register file (136 general purpose registers) [8] and 
uses a register windowing scheme for parameter passing. It can be 
argued that in the case of a RISC architecture with a register win- 
dowing scheme, there will be relatively fewer loads and stores for a 
given instruction mix. The code generated, however, depends on 
the quality of the compiler. Morrison and Walker [13] have shown 
that for a given instruction mix the compiler for the SPARC archi- 
tecture (the only architecture in this analysis that actually uses a 
register windowing scheme) reduced the the percentage of loads 
and stores by 3-9% compared to the MIPS compiler. The MIPS op- 
timizing compiler, however, generated fewer total instructions and 
sometimes up to 1/3 fewer instructions compared to the SPARC 
compiler. The MIPS compiler therefore, generated fewer total 
number of loads and stores. The instruction mix in Table 1 is used 
for comparing the performance of the SPARC with the R3000. The 
SPARC processor has a single address and data bus and uses a 
single cache for both instructions and data. The architecture in- 
cludes 2 deep instruction pre-fetch buffer and uses a value in the 
condition code register for conditional branching. A delayed 


Using these values in Equation 7, the average number of cycles 
per instruction is computed to be: 

CPIave = 1 .58 cycles 

and the pipeline efficiency is computed to be 
Pipelineeff = 1 /1.58 = .633 

from Equation 2 

Therefore, for the instruction mix described in Table 1 , the pipe- 
line operates at an efficiency of 63.3% . 


MC88100 


The Motorola MC88100 RISC processor includes 32 general 
purpose registers and uses a two-port, non multiplexed memory 
access scheme (Harvard architecture). The processor contains a 2 
stage instruction unit pipeline that fetches and supplies instruc- 
tions to the integer or floating-point unit [9]. Data memory accesses 
are pipelined and controlled by the 3 stage data unit. Because the 
integer unit consists of a single stage, there are effectively 6 pipe- 
line stages for an integer ALU operation performed on operands in 
memory. A delayed branching scheme is employed with a delay 
slot of 1 cycle. A separate adder for address calculations is also in- 
cluded so that branch target addresses are computed in parallel 
with instruction decoding. The branch penalty therefore is only 1 
cycle (the single delay slot). The load latency is 2 cycles because 
the memory management unit is off-chip and additional time is 
needed for address translation; stores on the other hand have no 
latency and execute in a single cycle. Because loads and stores 
are pipelined, there are possible memory conflicts that will add to 
the memory reference penalty. It is assumed that the compiler can 
reorganize the instruction stream so that the first load delay slot 
(first load latency cycle) can be filled with a useful instruction 70% 
of the time and the second delay slot can be successfully filled 30% 
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of the time. Therefore, the probability that the instructions in the 
load delay slots are no-op's can be computed using Equation 4. 
Equation 3 then reduces to: 

CPIave = 1 + bPbPnopb -4-1 Pi Pnopl (8) 

Pb = probability that an instruction is a branch (.15) 

b = number of branch delay slots (1 ) 

Pnopb = probability that an instruction in a branch delay slot is a 
no-op (.5) 

Pi - probability that an instruction is a load (.2) 

I = number of load delay slots (2) 

Pnopl = probability that the instructions in the load delay slots 
are no-ops (.5) 

effective cycles/load = 1 + IPnopi = 2 

effective cycles/store = 1 

effective cycles/branch * 1 + bPnopb = 1 .5 

Using these values in Equation 8, the average number of cycles 
per instruction is computed to be: 

CPIave =1.275 cycles 
and the pipeline efficiency is computed to be 
Pipelineeff = 1 /1.275 = .784 
from Equation 2 

Therefore for the instruction mix described in Table 1 , the pipe- 
line operates at an efficiency of 78.4% . 

R3000 

The R3000 RISC processor [10] has a 5-stage instruction pipe- 
line and includes 32 general purpose registers. The processor has 
both an address and a data bus which are cycled at twice the proc- 
essors clock frequency. This enables the processor to have sepa- 
rate instruction and data caches. The instruction cache is ac- 
cessed during one phase of the processor cycle, while the data 
cache is accessed on the other phase [11], All branch instructions 
execute with a delay slot of 1 cycle. The processor also includes a 
separate adderfor address computations which enables the target 
address of a branch instruction to compute in parallel with the in- 
struction decode phase of the pipeline [12]. The processor em- 
ploys a fast compare scheme for conditional branches (includes a 
separate comparator for equality, inequality and any relation with 
zero tests). Branch conditions, therefore, can be evaluated early in 
the pipeline and the ALU stage is not required for most conditional 
branches [2]. Thus, the branch penalty is 1 cycle (branch delay 
slot) that the compiler successfully fills 50% of the time. Load in- 
structions have a latency of 1 cycle (the address of the load is com- 
puted in stage 3, and the memory access is completed by the end 


of stage 4) and it is assumed that the compiler fills the load delay 
slot with a useful instruction 70% of the time. Store instructions 
have no latency and execute in a single cycle. Using these pa- 
rameters Equation 3 reduces to: 

CPIave = 1 + bPb Pnopb +1 Pi Pnopl (9) 

Pb = probability that an instruction is a branch (.15) 

b = number of branch delay slots (1) 

Pnopb = probability that an instruction in a branch delay slot is a 
no-op (.5) 

Pi = probability that an instruction is a load (.2) 

I = number of load delay slots (1) 

Pnopl = probability that the instruction in the load delay slots 
is a no-op (.3) 

effective cycles/load = 1 + IPnopi =1.3 

effective cycles/store = 1 

effective cycles/branch = 1+ bPnopb =1.5 

Using these values in Equation 9, the average number of cycles 
per instruction is computed to be: 

CPIave = 1.135 cycles 

and the pipeline efficiency is computed to be 
Pipelineeff = 1 /1.135 = .881 
from Equation 2 

Therefore, for the instruction mix described in Table 1 , the pipe- 
line operates at an efficiency of 88.1% . 

CONCLUSION 

The performance comparisons made above are summarized in 
Table 2 shown below. The effective number of cycles for branch 
and memory operations are also included. It is important to note 
that the CPIave and the Pipelineeff results shown in Table 2 pertain 
to the instruction mix described in Table 1 . Any instruction mix that 
has a greater frequency of loads, stores or branches will increase 
the average number of instructions per cycle and thereby de- 
crease the pipeline efficiency. The results show the detrimental ef- 
fect of load, store and branch latencies on pipeline efficiency. The 
analysis shows that the R3000 maintains the highest pipeline effi- 
ciency for the given instruction mix. This is because the pipeline ar- 
chitecture of the R3000 is optimally designed to minimize the la- 
tency effects of loads and branches. The branch latency in the 
R3000 is minimized by using a fast single cycle compare and 
branch and having the branch target address computed by the 
second stage of the pipeline. The load latency is minimized by hav- 
ing the cache control and tag compare logic on chip and accessing 
the instruction and data caches on alternate phases of the clock. 



Table 2. Performance Comparison Summary 
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The effect of instruction cache misses has not been taken into 
account in this analysis and it is critical that the cache miss penalty 
is minimized. In conclusion, the analysis shows that the latency ef- 
fects of memory reference and branch instructions are critical to 
pipeline performance. The R3000 processor is able to minimize 
these latencies to a greater extent than the Am29000, SPARC and 
MC88100 processors and thus obtain better pipeline efficiency. 

REFERENCES 

[1] J. K. F. Lee and A. J. Smith, “Branch Prediction 
Strategies and Branch Target Buffer Design", IEEE 
Computer, Jan. 1984, pp 6- 22. 

[ 2] S. McFarling and J. Hennessy, "Reducing the Cost of 
Branches”, Proc. 13th Annual Symp. on Computer 
Architecture, 1986, pp 396 - 403. 

[ 3] A. J. Smith, “Cache Memories”, ACM Computing 
Surveys, Vol. 14, No. 3, Sept. 1982, pp 474 - 530 
[ 4] D. J. Lilja, “Reducing the Branch Penalty in Pipelined 
Processors”, IEEE Computer, July 1988, pp 47 - 55. 

[ 5] T. R. Gross and J. L. Hennessy, “Optimizing Delayed 
Branches”, Proc. 15th annual Workshop on 
Microprogramming, Oct. 1982, pp 114-120. 

1 6] F. Baskett, “SGI’s 4D MIPS processor versus the 
Sun - 4 SPARC processor”, correspondence via 
electronic mail on the UNIX network to the comp.arch 
newsgroup, Dec 10, 1987. 


[ 7] Am29000 Streamlined Instruction Processor User 
Manual, Advanced Micro Devices Inc., 1988, pp4-l 
to 4-16. 

[ 8] SPARC MB86901 High Performance 32-bit RISC 
Processor, Product Description, Fujitsu 
Microelectronics Inc., Sept. 1988, pp 2-30. 

[ 9] MC88100 RISC Microprocessor User’s Manual, 
Motorola Inc., 1988. 

[10] G. Kane, MIPS RISC Architecture, Prentice-Hall Inc., 

1987, pp 1-7 to 2-15. 

[11] T. Riordan and P. Ries, MIPSR3000 Processor 
Interface, MIPS Computer Systems Inc., Oct 27 

1988, pp4-7. 

[12] T. Riordan et al, “The MIPS M2000 System”, Proc. 
International Conference on Computer Design 
(ICCD), Oct. 1988, pp 366 - 369. 

[13] S. Morrison and N. Walker, “Register Windows Vs. 
General Registers: A Comparison of Memory Access 
Patterns”, Technical Note, University of California, 
Berkeley, California, May 13, 1988. 


9.5 


5 




Integrated Device Technology, Inc. 


DESIGN TECHNIQUES 

FOR LOWERING POWER 
CONSUMPTION IN RISC DESIGNS 

APPLICATION 

NOTE 

AN-55 





by Barry Seldnor, FAE 


Most devices have two variables that relate to power 
consumption — static or standby power and dynamic or 
operational power. Dynamic power is consumed at a rate that 
is almost linear with speed. A CMOS rule is that the faster a 
device toggles, the more power is consumed. This exemplifies 
the typical CMOS characteristic where power is consumed 
during transition time. Operation, therefore, results in power 
consumption added to the static-power base line. 

Applications using RISC processors are performance- 
oriented; performance being the primary impetus in using 
RISC technology. Because speed equals power and RISC 
machines are high-speed, how can we reduce power without 
sacrificing performance? 

Let’s examine the memory system. A RISC memory 
system consists of two components — the main memory and 
the cache. The main memory, DRAM or SRAM, is only 
accessed when a cache miss occurs. If main memory is 
SRAM, it can be deselected effectively by putting the SRAM 
in powerdown mode. If the main memory is DRAM, it consumes 
little power outside of refresh and access. We can reduce the 
instantaneous current demands by using different refresh 
methods. 

During refresh, a large current spike is needed to revitalize 
the charge in the storage capacitors. The fewer DRAMs you 
refresh at a time, the lower instantaneous current the system 
will consume. To lower power consumption, the designer can 
distribute the refresh cycles over time and over banks of 
DRAM memory. It is important to note that, although this 
increases the complexity, it rarely impacts performance. Many 
DRAM controllers on the market today use this method of 
reducing DRAM power. This design philosophy should also 
reduce the noise a system produces. 

The cache memory subsystem is constantly accessed 
and, therefore, power reduction here is more complex. To 
better understand the power reduction opportunities, a more 
detailed understanding of this subsystem is needed. 

As in all RISC engines the R3000 implements a local 
cache. The Harvard architecture cache structure of the R3000 
is divided into two components; the instruction cache and the 
data cache. Both caches are physically mapped so that they 
can include their respective data, tag, valid bit, and parity 
equaling 60 bits per line (a line is a cache entry that has a 
unique tag). 

During a single R3000 cycle, both the instruction and data 
cache are accessed. If the RISC engine is running at 33MHz, 
the caches run at 264 MBytes/sec. This cache scheme 
requires that the cache be continuously selected, therefore 
always powered up. The only way power can be saved in the 
cache subsystem is by deselecting the caches during a cache 
miss. 


A model will help us understand how much power can be 
saved. Assume an average program contains the following 
characteristics: 55% ALU instructions, 15% branch instruc- 
tions, 20% load instructions, 10% store instructions, 95% I- 
cache hit rate, 85% D-cache hit rate, cache missing read/write 
latency = 5 cycles. If we were to use a program that has 100 
instructions we could compute: 

55 ALU instr. @ 1 cyc./instr. = 55.0 eye. 

10 store instr. @ 1 cyc./instr. = 10.0 eye. 

15 branch instr. @ 1.5* cyc./instr. = 22.5 eye. 

20 load instr. @ 1 .3* cvc. /instr. = 26.0 cvc . 

Total = 113.5 eye. 

*Since the R3000 has branch and load delay slots, we are 
going to conservatively estimate that the compiler can fill 
branch delay slots 50 percent of the time with useful instructions 
and fill load delay slots 70 percent of the time with useful 
instructions. Unfilled delay slots have nops inserted which do 
not count as useful instructions. 

This means that a 100 instruction program takes 113.5 
instructions when we take into account the load and branch 
delay slots. This results in a cycle instruction rate of 1 1 3.5/1 00 
= 1 .35 cycles per instruction. The cache miss rate is then 
added to the above rate. 

Instruction Cache Performance @ 95% Hit Ratio 

95 instructions hit in cache *1.135 cycles/instruction + 5 

Missed instructions *5 cycles latency *1.135 

cycles/instruction = 136.2 cycles. 

Data Cache Performance @ 85% Hit Ratio 

20 total data loads which miss at 15% rate 

20 * .15 = 3 load instructions miss 

3 loads @ 5 latency cycles = 15 cycles 

As shown, our original 100 instruction program, which 
took 1 13.5 cycles without cache hit criteria, now takes a total 
of 1 36.2 + 1 5 = 1 51 .2 cycles. Since 5 instructions missed and 
3 loads missed, 40 out of 151 were cache idle cycles or 
approximately 25%. Deselecting the cache during this time 
will produce the power consumed by the cache memory 
system. 

When implemented this scheme, keep in mind that when 
CS = False the SRAM goes into a standby node. One could 
assume that the power savings could be 87% for 25% of the 
time. With any method used to generate the percentages, the 
savings will be measurable. 
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It is very easy to decode real stalls and deselect the cache 
— reference Figures 1 and 2 Single Word Transfer (included). 
At the beginning of a stall, the R3000 output MemRd goes low 
to signal a cache miss and request a main memory read cycle. 
Since the RdBusy input is normally high, the boolean equation 
sets Cache CS high, deselecting the caches. Since RdBusy 
is sampled at the end of phase one, a complete cyc le exists 
before the caches are needed. The R3000 then sets MemRd 
high at the beginning of the fix-up cycle. After the fix-up cycle 
is complete, the main memory controller returns the RdBusy 
high again in preparation for the next miss cycle. A simple 
boolean equation can be written to provide the deselection 
function: 


Cache-CS = MemRd * RdBusy 

Write stalls are just as easy to decode. Note that write 
stalls only occur when the write buffer is full — s.e., data 
cannot be written into main memory at the processor rate. 
Therefore, because the model does not include write buffer 
stall cycles, their help will be difficult to demonstrate. Simula- 
tion history shows that these cycles are a small amount of the 
the total stall cycles, but we show how to decode them and 
deselect the cache since it takes very little logic to do so. 


MemWr goes low to indicate when a main memory write 
mu st be pe rformed. Conventional write buffer operation will 
set WrBusy low after it is full in orderto stall the R3000 on the 
next main memory write cycle. The R3000 will stall if another 
Store cycle occurs while the writ e buffer i s full and WrBusy is 
still low. Note that during the stall, Me mWrgoe s false so it must 
be captured with a register clock on SysOut. When the data 
can be accepted, WrBusy can be set high to terminate the 
stall. One cycle later, a fix-up cycle will occur where the data 
can b e captur ed in phase two. With this in mind, Cache-CS is 
NOT WrBusy. 

SUMMARY 

The R3000 has features that can minimize the power 
consumed. The model program created does not take into 
account instruction streaming or block refill. These features 
will improve performance and cache utilization which is in- 
versely proportional to idle cache time. Because performance 
is paramount and these features require added hardware, 
tradeoffs become more difficult. In the final analysis, cache 
idle time is a percentage of operating time — that percentage 
being application-specific. It is up to the designer to trade off 
the additional gates with powersavingswhichcanbe significant. 
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Figure 1. Single Word Instruction Transfer — Beginning of Stall 
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Figure 2. Single Word Instruction Transfer — End of Stall 
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by Philip Bourekas 


The R3000 includes parity over the data and tag portions 
of both the instruction and data caches. While this gives 
the system reliability people a “good” feeling about the re- 
liability of the design, a detailed analysis of the cost versus 
the benefits of parity protection may lead the system archi- 
tects to a different decision. 


The IDT R3001 RISController™ removes the restriction 
that parity must be used, and allows the system designer 
full flexibility to make the cost/benefits trade-offs. This ap- 
plication note provides information to help this analysis. 


59 58 55 54 36 35 32 31 0 


Valid 

Tag 

Parity 

Tags 

Data 

Parity 

Data 


IDT79R3000 Cache Entry 


Notes for IDT79R3000: 

All 60 bits must be input on every 
cache access. 



1 

31 


0 


Valid 

Tags | Data [ 

(0-19 bits) i Parity i 

1 1 

Data 





IDT79R3001 Cache Entry 



Notes for IDT79R3001: 

Data Parity is optional. 

The number of tag bits is determined 
at Reset time. 




Figure 1. Cache Line Formats of R3000 and R3001 


THE COST OF PARITY 

Parity in the system caches is costly and occurs in a 
number of areas: 

Static RAM Cost 

Parity consumes 7 bits from each of the instruction and 
data caches, for a total of 14 bits of cache. In most work- 
station applications parity adds to the system cost a mini- 
mum of 4 high-speed 16K x 4 static RAM devices. 

Figure 1 illustrates the different cache formats of the 
R3000 and the R3001. 

Read/Write Buffer Interface Logic 

Parity causes the system bus to be widened by 4 bits, 
thereby requiring extra devices in the read/write buffer path 
to accommodate parity. Additionally, typical designs do not 
carry parity in main memory: computer systems use EDC 
(Error Detection and Correction); embedded systems do 
not choose to incurr the overhead, cost and complexity of 
parity in main memory and, thus, do not include either par- 
ity or EDC. The requirement of parity in the caches also 
means that these systems must provide for parity genera- 
tion logic using, as an example, four IDT74F280 parity 
generators. 
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The shaded portions of Figure 2 illustrate the added over- 
head of parity in R3000 systems. 

Memory Speed 

In systems which omit parity from main memory, there is 
a performance impact for requiring cache parity. Block refill 
of the R3000 relies on data coming into the cache at the 
rate of one word per CPU clock cycle. With a parity gen- 
erator in this path, the system designer must either use 
faster memory to allow time for the parity generators (mem- 
ory must be fast enough to both provide a new word and 
calculate its parity at the rate of the CPU clock cycle), or 
the designer must pipeline memory. This requires adding 
extra logic and an extra cycle of latency to main memory so 
that the parity generation logic has a full clock cycle to 
produce results. Thus, the system designer is faced with 
two unattractive alternatives: purchase faster (more expen- 
sive) main memory devices than would otherwise be re- 
quired, or degrade performance and purchase extra logic to 
pipeline main memory to give the parity generation logic 
ample time. This is illustrated as the pipeline stage in 
Figure 2. 
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Cache Design Complexity vided to the processor on every cache access; this signifi- 

Many embedded applications utilize techniques (dis- cantly complicates the design of these reduced systems 
cussed in other IDT application notes) which reduce the since parity must be stored over tag bits which may not 
number of tag bits that must be stored in the cache memo- actually reside in memory. 



Figure 2. Impact of Parity on Main Memory and Interface 


THE BENEFITS OF CACHE PARITY external phenomenon, typically alpha particles. When one 

Now that the full cost of cache parity is understood, it is these events occurs the parity over the value which was 
important to recognize the protection gained by parity. changed would indicate that something was wrong and it is 

Cache parity is not intended to uncover timing or electri- left to the system to decide how to deal with the event. In 
cal faults in the design. It is also typically not intended to ^e case o{ ^e R3000, it is dealt with by processing a 
uncover hard failure devices. Instead, parity is designed to cache miss. In the R3001, when data parity is enabled, a 
detect the occurrence of a "soft failure”, whereby a “good” cache miss is processed as in the standar d R30 00 and, 
RAM has the value in one of its cells flipped by some additionally, the parity error is reported on the PErr pin. 
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In the R3000 there are two separate areas of each cache 
which are protected by parity. These are the tag and the 
data areas. Parity actually benefits the data portion of the 
system much more than it does the tag. If a tag bit flips 
due to a soft error, then to have an adverse effect, it must 
flip from a value that would have caused a cache miss to a 
value that allows a cache hit. Otherwise, a cache miss 
would be processed in any case. The incremental system 
reliability of TAG parity is very minimal since the statistical 
likelihood of a soft error forcing the incorrect instruction to 
be processed is minimal. 

Parity over the data portion of the caches performs a 
more important function. If a soft error were to occur, the 
consequences would be more serious: an erroneous data 
value may be loaded, an erroneous instruction may be exe- 
cuted or an illegal instruction exception may occur. Parity 
over this 32-bit field does have a finite, measurable benefit. 

The system designer must understand just how big is 
that benefit and weigh it against the system cost of main- 
taining parity. If the likelihood of a cache parity error is 
statistically insignificant relative to the probability of a disk 
failure or some other device failure, then the cost of parity 
far outweighs any benefit. 

The industry standard measure of the probability of a 
soft error is the FIT rate of the RAM devices. One FIT is 
equivalent to a mean time-to-failure of 10 9 device hours of 
operation. The FIT rate is measured by accelerating the 
“typical” life of the product; typically, soft errors are acceler- 
ated by exposing the die to Thorium-232. 

IDT static RAMs exhibit FIT rates of less than 10 (poly- 
imide die coating, used on most IDT SRAMs, lowers the 
FIT rate to effectively 0). This results in a mean time-to- 
failure of a single RAM device longer than 100 million de- 
vice hours. Of course, the more RAMs you use, the “less 
reliable” the system and the shorter the mean time-to-fail- 
ure. Typically, as the number of devices increases, an 
RMS analysis is used to determine the decrease in reliabil- 


ity This analysis will use the more severe assumption that n 
devices cause the reliability to degrade to 1/n of what it is 
with one device. 

If each 32-bit cache is built from eight 4-bit wide static 
RAMs, then there are 16 devices (this is the worst case; 
wider RAM devices have equivalent FIT rates, but fewer 
devices are required to build the caches). For mathemati- 
cal simplicity, let us use a FIT rate of 6.67 for each RAM 
(this FIT rate is significantly higher than that measured for 
the IDT 16K x 4 high-speed SRAM). Thus, 16 devices 
would have (16 x 6.67) failures in 10 9 hours, or a mean 
time-to-failure of 10 7 hours. 

Ten million hours is over 400,000 days or over 1,100 
years. Thus, providing parity for the R3000 cache is de- 
signed to safeguard for an event which is not likely to occur 
until after the year 3090. 

WHY DOES ANYONE USE PARITY? 

Given that parity appears extremely costly relative to the 
likelihood of the problem it solves, why does anyone go to 
the trouble of using it? This question has a number of 
answers. 

First, not all memory device architectures have such high 
resistance to soft errors. DRAM devices typically involve 
smaller, single transistor cells with no active pull-up device 
to lock a value into the data cell, while SRAM devices re- 
quire 4 to 6 transistors per memory bit and actively pull-up 
logic high values in the cell (Figure 3). DRAM devices 
require that each memory cell be periodically refreshed in 
order to maintain its stored value, while SRAMs have no 
such requirement. Thus, DRAMs are considerably more 
prone to soft errors than SRAMs, and parity protection is 
sometimes employed in these systems. Studies indicate 
that main memory, composed of extensive banks of DRAM, 
can have a probability of soft error in as short a period as 
20 days. Therefore, more designs are moving to Error De- 
tection and Correction to protect main memory. 




Typical DRAM Memory Cell Structure 
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Figure 3. DRAM Memory Cell Structure and SRAM Memory Cell Structure 
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Even within systems that use static RAM devices, parity 
may still be an acceptable cost. Space applications would 
tend to expose memory to a higher incidence of 
alpha-particles and, thus, have a higher likelihood of soft 
errors. Certain commercial applications prefer to support 
parity despite its cost. Banking applications, such as on- 
line transaction processing systems for ATM networks, could 
involve a significantly higher cost if a data error was al- 
lowed to propagate through the system. Other systems 
promote system parity as a “marketing” feature, promoting 
its “benefits" as added value to the customer. 

PARITY ON THE R3001 

The R3001 treats parity differently than the standard 
R3000. First of all, TAG parity is not supported at all. As 
shown above, the probability of a system problem due to a 
soft error on a RAM containing TAGs is extremely small. 
Since the R3001 supports variable TAG bus width, imple- 
menting TAG parity over this variable width field would have 
significantly complicated the control logic and slowed over- 
all device operation. 


On the data portions of each cache, the system designer 
has the ability to decide whether or not a given system 
supports data parity. If the application desires parity, the 
system designer can make this design decision. In this 
application, the system would still benefit from features of 
the R3001 such as its ability to reduce the width of the TAG 
bus (and thus reduce system cost) Another benefit from the 
R3001 is signaling to the system that a parity error has 
occurred and therefore allowing the system to distinguish 
this event from a normal cache miss cycle. 

SUMMARY 

The R3001 RISController™ gives the system designer to- 
tal flexibility over the cost/benefits trade-offs of the applica- 
tion. For the majority of systems and embedded applica- 
tions, parity can be omitted and significant cost savings can 
be achieved, For systems which still require parity, the 
R3001 allows the system designer to maintain parity while 
still adding features and benefits not found in the standard 
R3000. The result is a device well suited to a variety of 
applications, and software compatible (both at the kernel 
and user level) with the popular R3000. 
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INTRODUCTION 

The reduced instruction set computer, IDT79R3000, has 
allowed for simplicity in hardware and a synergistic relationship 
between the architecture and compilers. To increase the 
throughput of an IDT79R3000-based system, direct mapped 
cache memory is implemented. The availability of high-speed 
static RAMs from IDT gives the designer the flexibility of 
selecting the proper part for his cache system. To select the 
proper RAM to function as a cache for the IDT79R3000, it is 
necessary to know the speed requirements for the RAM. This 
application note contains timing information pertaining to the 
IDT79R3000 operating at different frequencies and those of 
the static RAMs that function as cache. The timing for the 
cache RAMs (ordinary SRAMs) was derived from the 
IDT79R3000 requirements andby making certain assumptions. 
This document clearly details those assumptions and the 
methodology used to calculate the RAM AC timing parameters. 
This methodology can be used by the designer to deduce the 
RAM timing parameter requirements for the IDT79R3000 
operating at any frequency. 


more detailed description of the functionality and an 
architectural description of the IDT79R3000, please refer to 
the references listed at the end of this document. 

This application note starts with a brief description of the 
IDT79R3000, the four-phase clock inputs, cache timings, 
required timings for SRAMs to function as cache for the 
IDT79R3000, the factors used in the timing calculations, and 
the conclusion reached. 

FUNCTIONAL DESCRIPTION 

The IDT79R3000 is a 32-bit RISC microprocessor that is 
currently available from Integrated Device Technology, Inc. in 
two packages: the 144-pin PGA and the 172-pin ceramic 
flatpack. it has a 32-bit data bus, a 32-bit address bus that is 
divided into the low-order 16-bits (AdrLo) and the high-order 
1 6 bits (AdrHi), control signals forthecache, control signalsfor 
main memory, and power and ground pins. The IDT79R3000 
also has four double-frequency clock inputs and one clock 
output used for interfacing the IDT79R3000 to the external 
world. 


GENERAL DESCRIPTION 

The IDT79R3000 is a RISC microprocessor used in a 
variety of applications ranging from low-end embedded 
controllers to high-end workstations. Until recently, the 
IDT79R3000 operated at a frequency of up to 25MHz. The 
need for more performance necessitated that the I DT79R3000 
operate at higher frequencies. Therefore, this application 
note specifies the timing considerations for an IDT79R3000 
microprocessoroperating at 33MHz. This document does not 
explainthefunctionalityofthelDT79R3000 nor its architecture, 
but is limited to describing the key timing parameters. For a 


FOUR-PHASE CLOCKS AND DELAY-LINE 
SETTINGS 

Figure 1 shows a block level diagram of the IDT79R3000 
with its four clock inputs coming from a delay line. Table 1 
shows a summary of the delay line settings to be used for 
different operating frequencies of the IDT79R3000. Please 
note carefully that Clk2xSys is taken as the zero time 
reference and comes from the first tap of the delay line. 
The other 2x clocks lag Clk2xSys in time and follow it with 
respect to delay line taps. 
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Figure 1. Four-Phase Clock Input to the IDT79R3000 
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Parameter 

16MHz 

20MHz 

25MHz 

33MHz 

Clk2xSys 

0 

0 

0 

0 

Clk2xRd 

6 

6 

6 

4.5 

Clk2xSmp 

6 

6 

6 

4.5 

Clk2xPhi 

16 

14 

12 

9 
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Table 1. Delay Line Settings for IDT79R3000 Operating at Different Frequencies 


DERATING CALCULATIONS AND CACHE 
TIMING CONSIDERATIONS USING X 4 SRAMS 

The design of the cache subsystem for the IDT79R3000 is 
straightforward. Industry standard static RAMs function as 
cache. This section discusses the methodology used to 
calculate the critical timing parameters for a static RAM so that 
it can function as cache forthe I DT79R3000. This section also 
examines the timings for a 16MHz, 20MHz, 25MHz and 


33MHz IDT79R3000. The timing equations derived take into 
account the effect of capacitive loading on the bus. The 
derating factors are calculated based on certain assumptions. 
These assumptions are detailed in this section and the derating 
factors are calculated. The timing equations are then 
discussed. At the end of this section, a table is included 
containingtheSRAMtimingsfordifferentoperatingfrequencies 
of the IDT79R3000. 


Instruction 


Data 


Cache Cache 
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Figure 2. Block Level Diagram of a Cache Subsystem With the IDT79R3000 Using IDT7198 
16K x 4 to Function as Cache (the IDT79R3010 is not shown in this figure) 
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Device Capacitance 

Figure 2 shows a typical IDT79R3000-based system. The 
cache is comprised of fast 16 K x 4 static RAMs (i.e., the 
IDT7198). The AdrLo busof the IDT79R3000 goes through a 
latch: the I DT54/74FCT373. It also goes through a latch which 
is used to address the main memory. All the devices have an 
input and an output capacitance. In addition, each device is 
capable of driving a certain load. These parameters: the input 


capacitance, the output capacitance and the load capacitance, 
are given in Table 2. 

The cache format of the IDT79R3000 is comprised of 60 
bits — 32 bits of data, 4 bits of data parity, 20 bits of tag, a valid 
bit, and three bits of parity to cover the tag and the valid bit. 
With this requirement, it isclearthat.forthe instruction cache, 
fifteen IDT7198s (16K x 4 SRAMs) are needed. The same 
number is also required for the data cache. This means that 
there are a total of 30 SRAM devices for the cache. 


Device 

Number of Devices 

Capacitance 

Total Capacitance 

IDT79R300 

1 

CiN = 1 0pF 

lOpF 

IDT7198 

2 

Cout = 7pF 

14pF 

IDT374A 

1 

COUT = 12pF 

12pF 

IDT823B 

1 

CiN = lOpF 

lOpF 
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Table 2. Capacitances of the Various Devices in a Typical IDT79R300 Systems 


Assumptions for Surface Mount Layout Design with x4 
SRAMs 

In the following sections, certain assumptions have been 
made while calculating the derating factors. These are as 
follows: 

1) The trace has a capacitance of 2pF/inch. 

2) The speed of light is 2ns/foot in epoxy. 

3) The IDT79R3000 speeds are specified with a loading of 
25pF. For every additional 25pF, there is a delay of 1 ns. 


4) The distances between the IDT79R3000 and the latches 
are approximately linch each. 

5) The distances between the IDT79R3000 and the RAMs 
are approximately 4inches each. 

6) In all of the assumption, it is assumed that a surface mount 
package is used. Figure 3 shows a brief mechanical layout 
of an IDT79R3000 board. 

Figure 3 shows a brief mechanical layout of an IDT79R3000 
board. 
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Figure 3. Surface Mount Board of an IDT79R3000 System With Cache and Main Memory Interface 
and Approximate Distances Between the Various Devices 
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Address Bus Derating Calculations 

For the system shown in Figure 4.1, each address bit is 
connected to five latches: one going to the main memory 
interface buffer, two to the instruction cache and tag memory, 


and two to the data cache and tag memory, respectively. The 
latches in turn are connected to the address pins on the static 
RAM. Figure 4 shows all the devices to which each address 
bit is connected. 


Bit A2 
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Figure 4. Block Diagram Showing Various Devices Connected to One Address Bit 


Trace length from the CPU to the latch = 4inches — (1) 

Capacitance of the trace, Ctrace= 4 x 2pF/inch = 8pF — (2) 

Input capacitance of the 373 latch = lOpF — (3) 

As each address bit is connected to five latches (IDT54/74FCT373), 

Total input capacitance due to 5 devices, C373in = 5x10 = 50pF — (4) 

Total capacitive load = Ctrace + C373in = 8 + 50 = 58pF — (5) 

The rated IDT79R3000 load, CL(R3000) = 25pF — (6) 

From Eq. (5) and Eq. (6), 

Extra capacitive loading for the IDT79R3000 = 58 - 25 = 33pF — (7) 

Let us now examine the capacitive loading between the latches and the RAM. 

Path length from latches (IDT54/74FCT373s) to RAM (IDT7198s) = 3" — (8) 

Trace capacitance from latch to RAM = 3 x 2pF/in = 6pF _ (9) 

Input capacitance of the RAM = 5pF — (10) 

Each output from the latch is connected to eight RAM devices. 

Load due to 8 devices = 8x5 = 40pF — (11) 

Total capacitance = 40 + 6 = 46pF — (12) 

The rated 373 load = 50pF — (13) 

From Eq. (12) and Eq. (13) it can be seen that there is no RAM. However, there is a delay due to the capacitive load 
delay due to the capacitive load between the latch and the between the IDT79R3000 and the latch. This delay can be 

calculated as follows: 

For every extra 25pF of load, there is a delay of Ins. — (14) 

From Eq. (7) and Eq. (14), delay due to the capacitive load = 33 /25 = 1.32ns - (15) 

The speed of light = 2ns/foot — (16) 
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For a maximum path length of 5", delay = 5712" x 2 = 0.8ns — (17) 

From Eq. (15) and Eq. (17), 

Total propagation delay for the address bus, AdrLo d = 1 .32 + 0.8 = 2ns — (18) 

Data Bus Derating Calculations (IDT7198),thedatacache(IDT7198),areadregister(IDT54/ 

The derating calculations for the data path are similar to 74FCT374A) and a write register (IDT54/74FCT823). This is 
those done for the address path. The data bus is connected shown in Figure 5. Two cases must be considered: a data 
to the floating point unit (IDT79R3010), the instruction cache store and a data fetch. Both are discussed. 


Bit DO 
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Figure 5. Block Diagram Showing Various Devices Connected to One Address Bit 


Data Store (IDT79R3000 CPU Outputs Data): 

Each data bit is connected to two RAMs (!DT7198s) — one for instruction and one for data. 

The path length for the data bus = 5" — (18) 

Trace capacitance for the data bus = 5 x 2pF/in = lOpF — (19) 

Capacitive loading due to devices, 

Cdevices = 2 x CRAMin + C374in + C823 + CR3010 — (20) 

Cdevices = 2x7 + 12 + 10 + 10 = 46pF — (21) 

Total capacitive load = Ctrace + Cdevices = 46 + 10 = 56pF — (22) 

Propagation delay due to speed of light = 5712” x2 = 0.8ns — (23) 

Delay due to capacitive load = (56 -25 ) /25 = 1.24ns — (24) 

From Eq. (23) and Eq. (24), 

Total propagation delay on a store = 1 .24 + 0.8 = 2ns — (25) 

Load ( RAM Provides Data) 

Since the trace length is the same, Ctrace = 1 0pF _ (26) 

Capacitive load due to devices, 

Cdevices = CR3000 + CR3010 + CRAMin + C374in + C823 

Cdevices = 10 + 10 + 12 + 10 + 7 = 49pF — (27) 
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Total capacitance = Ctrace + Cdevices = 10 + 49 = 59pF 
The RAM rated drive is 30pF 

Extra load = Total capacitance - RAM rated drive = 59 - 30 = 29pF — (28) 
Propagation delay due to capacitive load = 29/25 = 1 .1 6ns — (29) 


Propagation delay due to the path length = 0.8ns — (30) 
Total propagation delay = 1.16 + 0.8 = 2ns — (31) 


Read and Write Control Derating Calculations 

The effect of the capacitance on the control signals from the 
I DT79R3000 processorto the caches and the memory interface 
is consi dered here. T he c ontro l signals on the IDT79R3000 
are the IRd, .DRd, IWr and DWr which control the instruction 
cache read, data cache read, instruction cache write and data 
cache write, respectively. The read and write control signals 


are connected to the output enable (OE), and write enable 
(WE) of the instruction and data cache, respectively. Two 
control signals each are provided for the read and write 
operations of each of the caches. Assuming the use of a 1 6 
K x 4 IDT71 98 static RAM, each control signal is connected to 
eight such static RAMs. 


Number of devices (SRAM) connected to each control line = 8 — (32) 
Input capacitance of each device (SRAM) = 5pF — (33) 

Total load capacitance = 5x8 = 40pF — (34) 

Path length = 5" — (35) 

Trace Capacitance = 5 x 2pF/in = lOpF = lOpF — (36) 

Total capacitance = 40 + 10 = 50pF — (37) 

Extra capacitive load = 50 - 25 = 25pF — (38) 

Propagation delay due to capacitive load = Ins — (39) 

Propagation delay due to the trace length = 0.8ns — (40) 


Total propagation delay = 1 + 0.8 = 2ns — (41) 


Assumptions for Through-hole Layout Design Using x4 
SRAMS 

In this section, the deratings are calculated for a through- 
hole design. Figure 6 shows an example of the layout of a 


through-hole design. This layout corresponds to an 
IDT79R3000 demonstration board used extensively at IDT. 
The data trace lengths are 10 inches and the address trace 
lengths are 9 inches. 
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Figure 6. Board Layout for a Through Hole Design of an IDT79R3000 Cache Subsystem 
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Address Derating Calculations 

For the system shown in Figure 4, the number of devices connected to the IDT79R3000 is the same. 
Trace length from the CPU to the latch = 9" — (42) 

Trace capacitance = Ctrace = 9x2=1 8pF — (43) 

Input capacitance of the latches = 1 0pF — (44) 

Total capacitance = 5 x C373 + Ctrace = 5x10 + 18 = 68pF — (45) 

Extra load on the R3000 = Cl = 68 - 25 = 43pF — (46) 

Since the rated IDT54/74FCT373 load is 50pF, there is no derating factor between the IDT54/74FCT373 and 
the RAMs. 


Therefore the derating is between the IDT79R3000 and the latches. 


Delay due to capacitance = 43/25 = 1 ,75ns — (47) 

Propagation delay due to the trace length = 9/12x2= 1.5ns — (48) 


Total derating on the address bus = 1 .75 + 1 .5 = 3ns — (49) 


Derating on the Data Bus 

As in the section entitled Data Bus Derating Calculations, 
the derating for the data bus is calculated for two cases: i) an 
instruction fetch and ii) data store. 


Data Store (IDT79R3000 CPU Outputs Data) 

Each data bit is connected to two RAMs (IDT71 98s) — one 
for instruction and one for data. 
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The path length for the data bus = 10" — (50) 

Trace capacitance for the data bus = 10 x 2pF/in = 20pF — (51) 

Capacitive loading due to devices, 

. Cdevices = 2 X CRAMin + C374in + C823 + CR3010 — (52) 

Cdevices = 2x7 + 12 + 10 + 10 = 46pF — (53) 

Total capacitive load = Ctrace + Cdevices = 20 + 46 = 66pF — (54) 

Propagation delay due to speed of light = 10712" x 2 = 1 ,6ns — (55) 

Delay due to capacitive load = (66 - 25 ) /25 = 1 .55ns — (56) 

From Eq. (23) and Eq. (24), 

Total propagation delay on a store = 1 .54 + 1 .67 « 3ns — (57) 

Load ( RAM Provides Data) 

Since the trace length is the same, Ctrace = 20pF — (58) 

Capacitive load due to devices, 

Cdevices = CR3000 + CR3010 + CRAMin + C374in + C823 
Cdevices = 10 + 10 + 12 + 10 + 7 = 49pF — (59) 

Total capacitance = Ctrace + Cdevices = 20 + 49 = 69pF 
The RAM rated drive is 30pF 

Extra load = Total capacitance - RAM rated drive = 69 - 30 = 39pF — (60) 

Propagation delay due to capacitive load = 39/30 = 1 .3ns — (61) 

Propagation delay due to the path length = 1 ,6ns — (62) 

Total propagation delay = 1 .3 + 1 .6 = 3ns — (63) 

Read and Write Control Deratings effect. The trace length from the CPU to the RAMs is 9 inches 

For a through-hole design, the effect of derating on the for the layout shown in Figure 6. Each control signal is 
control signals will be more. This section calculates that connected to 8 devices. 

Number of RAM devices connected to each control signal = 8 — (64) 

Input capacitance of each RAM = 5pF — (65) 

Total load capacitance = 8x5 = 40pF — (66) 

The trace length = 9' — (67) 

Trace capacitance = 9 x 2pF/inch = 18pF — (68) 

Total load capacitance = 40 + 18 = 58pF — (69) 

Extra load = 58 - 25 = 33pF — (70) 
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Derating due to capacitive load = 33 / 25 = 1 ,35ns — (71 ) 

Propagation delay due to trace length = 9 / 2 x 2ns/foot = 1 ,5ns — (72) 


Total derating = 1 .35 + 1.5 = 3ns — (73) 

Timing Equations for Cache Design 

This section deals with the timing equations that enable us 
to determine the critical timing requirements of the static RAM 
that will be used as cache. These equations are based on the 
use of static RAMs (without built-in latches) as cache RAMs. 
The superscript ‘d’ in the following equations denotes the 
deratingsto betaken into account. The static RAM chosen for 
illustration here is a 1 6K x 4 IDT71 98. The board Is assumed 
to be surface mount for all speeds of the IDT79R3000 
except for the 16MHz speed grade. The deratings for the 
surface mount board are 2ns and 3ns for a through-hole board 
(which is used for the 16MHz IDT79R3000). The deratings 
were derived from certain assumptions. The explanation and 
the methodology used is set forth in the previous sections. 
Following is a generalized equation and the timing requirements 


for different frequencies of the IDT79R3000. All calculations 
are based on the IDT79R3000 specifications for the four 
speed versions ( 1 6, 20, 25 and 33MHz) .which are found in the 
IDT data sheets. 

Figures 7, 8, 9, and 10 show the timing diagrams of the 
IDT79R3000 when it is doing a data store followed by an 
instruction fetch. This is the worst case example and is 
chosen to determine the SRAM parameter requirements. 
Figure 7 shows the timing diagrams for an IDT79R3000 
operating at 16MHz. Figures 8, 9 and 10 show the timing 
diagrams for an IDT79R3000 operating at 20MHz, 25MHz 
and 33MHz, respectively. The encircled numbers represent 
the equations presented in the section entitled Timing 
Equations for Cache Design. The timing diagram, in 
conjunction with the equations, are usedto arrive atdetermining 
the timing requirements. 
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The following equations are used to determine the timing 
parameters for the static RAM so that they can function as 
cache for different operating frequencies of the IDT79R3000. 
The numbers at the left correspond to the encircled numbers 


in the timing diagrams. Equations 9 and 1 0 are not shown in 
the timing diagram but are included for completeness. The 
equations also use some IDT79R3000 parameters. These 
are listed in Table 3. 


(1 ) Internal Sample to Phase Delay 

This is the time that the processor needs to sample the incoming data. T ypically, for the IDT79R3000, 
tsmp > 5 

(2) RAM Address Access Time 

This equation is used to determine the Address Access time parameter requirements of the static 
RAM. From the timing diagram of Figure 10, it is easily calculated. As an example, let us calculate the 
address access time for a 33MHz IDT79R3000. The total cycle time for a 33MHz IDT79R3000 is 30ns. 
If the processor's sample time requirement is met, the time remaining in the cycle is 24ns. In this time 
the data has to be presented to the processor. The processor requires a data set-up time of 4ns. There 
is also a propagation delay through the latch for the address bus. For the 33MHz part, a fast IDT54/ 
74FCT373C is used which has a maximum propagation delay of 4.2ns (see Table 4). The derating 
factors due to the capacitance and the trace length also have to be taken into account. Using all these 
factors, the equation is: 

tRAMAA < tcyc - tsmp - tDS - t373PD - tAdrLo d - tRAMAA d 

16MHz IDT79R3000: tRAMAA <60-10-9-5.2-3-3 
tRAMAA < 29.8 

20 MHz IDT79R3000: tRAMAA <50-8-8-5.2-2-2 
tRAMAA < 24.8 

25 MHz IDT79R3000: tRAMAA <40-6-6-5.2-2-2 
tRAMAA < 18.8 

33 MHz IDT79R3000: tRAMAA < 30 - 4.5 - 4.5 - 4.7 - 2 - 2 
tRAMAA < 12.3 

(3) Cache Enable to Sample 

This equation is used to determine the system output enable(toES) requirements of the cache RAM 
and should meet the processor’s set-up specification. The output enable time (tOE) specifications forthe 
RAM are tested fora voltage change of 200mV(afallfrom 1.732V to 1. 532Vfor IDT RAMs). Fora system, 
however, the voltage falls from approximately 3.3V to 1.5V. This fall time is usually a nanosecond. 

Therefore, the RAM specifications should take this system factor into consideration and specify 
the output enable time at least one nanosecond lower than the calculated timings. 

tOES < tcyc/2 - tRD* - tDS - tsys-smp + tsys-rd - tOES? 

16MHz IDT79R3000: tOES <30-3-9-10 + 10-3 
tOES < 1 5 

20MHz IDT79R3000: tOES <25-2-8-8 + 8- 2 
tOES < 13 

25MHz IDT79R3000: tOES <20-2-6-6 + 6- 2 
tOES < 1 0 

33MHz IDT79R3000: t OES < 15 - 2 - 4 - 4.5 + 4.5 - 2 
tOES < 7 
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(4) Minimum Read Pulse Width 

This timing requirement guarantees that the read pulse width generated by the processor is at least 
as long as the cache RAM output-enable time. 

tOES < tcyc/2 - tsys-rd - tOES? 

16MHz IDT79R3000: tOES <30-10-3 
tOES < 1 7 

20MHz IDT79R3000: tOES <25-8-2 
tOES < 15 

25MHz IDT79R3000: tOES <20-6-2 
tOES < 1 2 

33MHz IDT79R3000: tOES <15-4.5-2 
tOES < 8.5 

(5) Read-Write 1-Cache Data Bus Contention 

This timing requirement ensures that the RAM output is tristated soon enough atter the instruction 
read signal goes high. In the worst case, when the processor performs a store operation, no data 
contention occurs. 
tRAMHZ < tsys - tRc? + DEn 

16MHz IDT79R3000: tRAMHZ < 16 - 3 + (-2.5) 
tRAMHZ < 10.5 

20MHz IDT79R3000: tRAMHZ < 14 - 2 + (-2) 
tRAMHZ < 10 

25MHz IDT79R3000: tRAMHZ < 12 - 2 + (-1 .5) 
tRAMHZ < 8.5 

33MHz IDT79R3000: tRAMHZ < 9 - 2 + (-1) 
tRAMHZ < 6 

(6) Processor Data Set-up to End of Write 

This enables the designer to determine whether the cache RAMs have adequate data set-up time 
when the processor does a store operation. In the equation, the minimum derating is used on the write 
line i.e., twr d , because that is the worst case assumption. 

tRAMDS < tcyc/2 - tsys-smp - tDVal - tDVaf 1 - tWr d 

16MHz IDT79R3000: tRAMDS < 30 - 10 - 3 - 3 - (-2) 
tRAMDS < 16 

20MHz IDT79R3000: tRAMDS < 25 - 8 - 3 - 2 - (-1) 
tRAMDS < 13 

25MHz IDT79R3000: tRAMDS < 20 - 6 - 2 - 2 - (-1 ) 
tRAMDS < 1 1 

33MHz IDT79R3000: tRAMDS < 15 - 4.5 - 2 - 2 - (-1) 
tRAMDS < 7.5 
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(7) Data Hold from End of Write 

This parameter requirement guarantees that the data hold from end of write of the cache RAM is met 
when the processor or the read buffer is writing to the RAMs. 
tRAMHD < tsmp-rd + tRAMLZ 

16MHz IDT79R3000: tRAMHD <0 + 2 
tRAMHD < 2 

20MHz 1DT79R3000: tRAMHD <0 + 2 
tRAMHD < 2 

25MHz IDT79R3000: tRAMHD <0 + 2 
tRAMHD < 2 

33MHz IDT79R3000: tRAMHD <0 + 2 
tRAMHD < 2 

(8) Data SetUp to SysClk 

This timing parameter ensures that the set-up time into an external register (for the main memory 
interface) is sufficient enough for when the processor is doing a store. The data is clocked in the register 
on the rising edge of the buffered SysOut (through an inverting IDT54/74FCT240A). In this equation, 
tsys(min) d is used to ensure worst case calculations. 

tsetupSys < tcyc/2 - tsys - tDVal - tDVafi + tsys d + t240PDmin 

16MHz IDT79R3000: tSeiUpSys < 30-16-3-3 + 2 + 1.5 
tSetUpSys < 11.5 

20MHz IDT79R3000: tSetUpSys < 25 - 14 - 3 - 2 + 1 + 1 .5 
tSetUpSys < 8.5 

25MHz IDT79R3000: tSeiUpSys < 20 - 12 - 2 - 2 + 1 + 1 .5 
tSetUpSys < 6.5 

33MHz IDT79R3000: tSeiUpSys <15-9-2-2 + 1+1.5 
tSetUpSys < 4.5 

(9) Data Hold from SysClk 

This timing parameter is to guarantee that the hold time specification for an external register is met 
on a processor store. In this equation the minimum value of tRD d is taken to ensure worst case numbers. 

tHoldSys < tsys-rd - tsys d - t240PDmax + tRAMLZ + tRc? 

16MHz IDT79R3000: tHoldSys < 6 - 2 - 4.8 + 2 + 1 
tHoldSys < 2.2 

20MHz IDT79R3000: tHoldSys < 6 - 1 - 4.8 + 2 + 1 
tHoldSys < 3.2 

25MHz IDT79R3000: tHoldSys < 6 - 1 - 4.8 + 2 + 1 
tHoldSys < 3.2 

33MHz IDT79R3000: tHoldSys <4.5-1 - 4.8 + 2 + 1 
tHoldSys < 1 .9 
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(10) Address Set-up to End of Write: 

This equation enables us to determine the timing requirement for the RAM so that the address set- 
up time is sufficient before the trailing edge of the write pulse. 

tRAMAW < tcyc - tsmp-sys - tAdrLa i - t373PD + tWi d 

16MHz IDT79R3000: tRAMAW < 60-10-3-5.2 + 3 
tRAMAW < 44.8 

20MHz 1DT79R3000: tRAMAW <50-8-2-5.2 + 2 
tRAMAW < 36.8 

25MHz IDT79R3000: tRAMAW <40-6-2-5.2 + 2 
tRAMAW < 28.8 

33MHz IDT79R3000: tRAMAW <30-4.5-2-4.7 + 2 
tRAMAW < 20.8 

(11) Write Hold Pulse-Width: 

This requirement guarantees that the cache RAM's minimum write pulse width specification is met. 
tRAMPW< tcyc/2 - tWrDly 

16MHz IDT79R3000: tRAMPW <30-5 
tRAMPW < 25 

20MHz IDT79R3000: tRAMPW <25-4 
tRAMPW <21 

25MHz IDT79R3000: tRAMPW <20-3 
tRAMPW < 17 

33MHz IDT79R3000: tRAMPW <15-2 
tRAMPW < 13 

(12) Write Recovery Time: 

The write recovery time is the time between the write pulse going inactive and the change in address. 
This characteristic is usually specified by the SRAM manufacturer and is typically zero. This parameter 
is important in the IDT79R3000 cache interface and care must be taken to choose the proper part to 
prevent race conditions. In the IDT79R3000 cache design using the IDT7198 16 Kx4 RA M, the latch 
enable is controlled by ICIk/DCIk and the write enable on the RAM is controlled by I Wr/DWr. The timing 
diagram shows the relationship between the two clocks and the parameter tWR Timing calculations 
below show that the write recovery specifications are not violated. 

Derating Calculations for DCIk and DWr 2) The trace length for the DWr signal is 6 inches. 

To cal culate the effect of derating on the control signals 3) The trace length for the DCIk signal is 2 inches. 

DCIk and DWr, the following assumptions have been made: 4) The trace length of the address bus to the RAM is 4 

1) The pin to pin variation on an IDT79R3000 device is 15 inches. 

% for a 50pF load. Under the maximum cas e, the 5) Each ICIk control signal is connected to four IDT54/ 
deratings will vary from 1.7 to 2ns for DCIk and DWr. 74FCT373 devices. 

Under the minimum case the deratings will vary from 
0.58 to 0.625ns. 
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2853 drw 11 

Figure 11. Circuit Showing IWr and ICIk Signals to Latch and SRAM 


The input and ouput capacitances for the IDT79R3000, 
IDT71 98 and IDT54/74FCT373 can be obtained from Table 2. 
Figure 1 1 is a simple circuit showing the connections of ICIk 
and IWr from the IDT79R3000to the latch enable (LE) on the 


IDT54/74FCT373 device and Write Enable (WE) on the static 
RAM, respectively. Figure 12 shows the tWR timings with 
respect to the data cache in an IDT79R3000-based system. 
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Figure 12. Write Recovery Timing 


To prove that the tWR parameter is not violated, the 
calculations are done as shown below. The derating ef fects 
on the DCIk and AdrLo signal should exceed that of the DWr 
signal. These calculations are similar to the derating 
calculations in previous sections. The minimum propagation 


delay through the latch is considered. The derating on the 
DC Ik sig nal coming out of the IDT79R3000 is less than that of 
the DWr signal . The reverse case is superfluous and, in fact, 
improves the situation. The minimum and worst case derating 
effects are shown below. The write recovery time parameter 
must not be violated over the entire operating range. 


Capacitive Derations: (IDT79R3000 Variations 15% 1.7ns - 2ns ) 


(( Cdriver + Cload + Ctrace) - Crated)/25 * (MinOrMax) = CLD 
Iclk ((10 + 4* 10 + 2*2) -25)/25* 1.7 = 1.97ns 

Iwr ((10 + 8*7 + 6*2)- 25)125 * 2 = 4.24ns 

RamAddr ((12 + 8*7 + 4*2)- 50)/25 * 0.5 = 0.52ns 
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Calculations : 

Race path 1 : Iclkmin + tPD(ie) + RamAdrmin = ( 1 .97 + 2 + 0.52 ) = 4.49 
Race path 2 : Iwrmax = 4.24 

Pathl - Path2 > twR 


4.49 - 4.24 > 0 


Capacitive Derations: (IDT79R3000 Variations 15% 0.58ns - 0.625ns ) 


(( Cdriver + Cload + Ctrace) - Crated)/25 * (MinOrMax) = CLD 
Iclk ((10 + 4*10 + 2*2)- 25)/25 * 0.58 = 0.58ns 

Iwr ((10 + 8*7 + 6* 2)- 25)/25 * 0.625 = 1 ,325ns 

RamAddr ((12 + 8* 7 + 4*2)- 50)/25 * 0.5 = 0.52ns 
Calculations : 

Race path 1 : Iclkmin + tPD(ie) + RamAdrmin = ( 0.58 + 2 + 0.52 ) = 3.1 ns 
Race path 2 : Iwrmax = 1 .325 

Pathl - Path2 > twR 

3.1 - 1.325 = 1.775 >0 


From the above calculations and the RAM timing Tables 4 
and 5, it can be seen that the data set-up to the processor is 
met. The output enable of the RAM, which is controlled by IRd, 
goes high and the RAM output starts to go tri-state. From the 
figure, the reader may correctly question whether the hold 
time requirements of the IDT79R3000 are met. They are 
indeed met by the capacitance on the bus and also because 
CMOS devices are being used. The technical note entitled 
Meeting Bus Hold for the IDT79R3000 gives a more detailed 
explanation. 


Table 4 gives the timing data sheet for a typical SRAM 
device. The timing parameters correspond to a particular 
RAM configuration. Other RAM devices may have different 
timings for some of the parameters; however, there are certain 
timings that must be met. These critical parameters are listed 
in Table 5 and the unlisted parameters may vary a bit from 
device to device. 
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AC ELECTRICAL CHARACTERISTICS — COMMERCIAL TEMPERATURE RANGE 


Min. Max. Min. Max. Min. Max. Min. Max. 


12.5 


12.5 


30 500 


Symbol 

Parameter 

Clock 

TCkHigh 

Input Clock High 

TCkLow 

Input Clock Low 

TCkP 

Input Clock Period 


Clk2xSys to Clk2xSmp 

Clk2xSmp to Clk2xRd 

Clk2xSmp to Clk2xPhi 


TDen 

Data Enable 

TDDis 

Data Disable 

TDVal 

Data Valid 

TWrDly 

Write Delay 

TDS 

Data Set-Up 

TDH 

Data Hold 

TCBS 

CpBusy Set-Up 

TCBH 

CpBusy Hold 

TAcTy 

Access Type [1 :0] 

TAT2 

Access Type [2] 

TMWr 

Memory Write 

TExe 

Exception 


Stall Operation 


Reset Initialization 



TSAVal 

Address Valid 

TSAcTy 

Access Type Valid 

TMRdi 

Memory Read Initiate 

TMRdT 

Memory Read Terminate 

TSd 

Run Terminate 

TRun 

Run Initiate 

TSMWr 

Memory Write 

TSEx 

Exception Valid 



1 | 27 


20 


TRST 

Reset Pulse Width 

6 

TrstPLL 

Reset Timing, PLL (1) On 

3000 

Trstcp 

Reset Timing, PLL Off 

128 


Capacitive Load Derating Factor 


Load Derate 


NOTE: 

1 . PLL: Phase Locked Loops. 


15 

ns 

10 

ns 

10 

ns 

3.5 

ns 

8 

ns 

3 

ns 

9.5 

ns 

10 

ns 



1 

mm 

i 

mm 



Table 3. IDT79R300 AC Specifications 
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READ CYCLE TIMING SPECIFICATIONS 



16.7MHZ 

20.0MHz 

25.0MHz 

33.0MHz 

Parameter 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

tRC 

30 

— 

25 

— 

20 

— 

12 

— 

tAA 

— 

30 

— 

25 

— 

19 

' — 


tACSI 

— 

30 

— 

25 

— 

19 

— 

15 

tCLZI 

5 

— 

5 

— 

5 

— 

2 

— 

tOE 

— 

15 

— 

13 

— 

10 

— 

7 

tOLZ 

5 

— 

5 

— 

5 

— 

3 

— 

tCHZI 

— 

12 

— 

10 

— 

8 

— 

8 

tOHZ 

— 

10 

— 

10 

— 

8 

— 

6 

tOH 

5 

— 

5 

— 

5 

— 

0 

— 

tPU 

0 

— 

0 

— 

0 

— 

0 

— 

tPD 

— 

30 

— 

25 

— 

20 

— 

15 


2653 tbl 04 


WRITE CYCLE TIMING SPECIFICATIONS 


Parameter 

16.7MHZ 

20.0MHz 

25.0MHz 

33.0MHz 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

twc 

30 

— 

25 

— 

20 

— 

15 

— 

tcwi 

25 

— 

21 

— 

17 

— 

15 

— 

tAW 

25 

— 

21 

— 

17 

— 

15 

— 

tAS 

0 

— 

0 

— 

0 

— 

0 

— 

tWP 

25 

— 

21 

■ — 

17 

— 

13 

— 

tWFtl 

0 

— 

0 

— 

0 

— 

0 

— 

tWR2 

0 

— 

0 

— 

0 

— 

0 

— 

tWHZ 

— 

18 

— 

16 

— 

8 

— 

6 

tDW 

16 

— 

13 

— 

11 

— 

7 

— 

tDH 

0 

— 

0 

— 

0 

— 

0 

— 

tow 

5 

— 

5 

— 

5 

— 

5 

— 


2853 tbl 05 

NOTE: 

1 . This assumes that an IDT54/74FCT373C with a tPD = 4.2ns is used. 


Table 4. Static RAM Read and Write Timings to Work as Cache With the IDT79R300 
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READ CYCLE TIMING SPECIFICATIONS* 1 * 


Parameter 

16.7MHZ 

20.0MHz 

25.0MHz 

33.0MHz 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

tRC 

* 

— 

* 

— 

* 

— 

* 

— 

tAA 

— 

29.8 

— 

24.8 

— 

18.8 

— 

12.8 

tACSI 

— 

* 

— 

* 

— 

* 

— 

* 

tCLZI 

* 

— 

* 

— 

* 

— 

* 

— 

tOE 

— 

15 

— 

13 

— 

10 

— 

7 

tOLZ 

* 

— 

* 

— 

* 

— 

* 

— 

tCHZI 

— 

* 

— 

* 

■ — 

* 

— 

* 

tOHZ 

— 

10.5 

— 

10 

— 

8.5 

— 

6 

tOH 

* 

— 

* 

— 

* 

— 

* 

— 

tPU 

* 

— 

* 

— 

* 

— 

* 

— 

tPD 

— 

* 

— 

* 

— 

* 

— 

* 
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WRITE CYCLE TIMING SPECIFICATIONS' 1 * 


Parameter 

16.7MHZ 

20.0MHz 

25.0MHz 

33.0MHz 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

twc 

* 

— 

* 

— 

* 

_ 

* 

— 

tcwi 

* 

— 

* 

— 

* 

— 

* 

— 

tAW 

44.8 

— 

36.8 

— 

28.8 

— 

19.8 

— 

tAS 

* 

— 

* 

— 

• 

— 

* 

— 

tWP 

25 

— 

21 

— 

17 

— 

13 

— 

tWRI 

* 

— 

* 

— 

* 

— 

‘ 

— 

tWR2 

* 

— 

* 

— 

* 

— 

* 

— 

tWHZ 

— 

* 

— 

* 

— 

* 

— 

* 

tDW 

16 

— 

13 

— 

11 

— 

7 

— 

tDH 

* 

— 

* 

— 

* 

— 

* 

— 

tow 

* 

— 

♦ 

— 

* 

— 

* 

— 
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NOTE: 

1. Shown are the minimum or maximum parameters. Numbers not shown are not critical for the IDT79R3000 application. 

Table 5. Static RAM Parameters to Work as Cache with the IDT79R3000 
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Parameter 

Load 

Symbol 

Min. 

Max. 

IDT54/74FCT373A Propagation Delay 

50 

T373PD 

— 

5.2 

IDT54/74FCT373A Latch Enable Delay 

50 

T373LE 

2 

8.5 

IDT54/74FCT373A Latch Enable Hold 

50 

T373Hld 

1.8 

— 

IDT54/74FCT240A Propagation Delay 

50 

T240PD 

1.5 

4.8 

IDT54/74FCT373C Propagation Delay 

50 

T373PD 

1.5 

4.7 

IDT54/74FCT240C Propagation Delay 

50 

T240PD 

1.5 

CO 


2853 tbl 08 


Table 6. Timing Parameters of IDT54/74FCT Logic Devices 


Legend: 

tRAMAA - RAM Access Time 

tRAMOE - RAM Output Enable Time 

tRAMHZ - RAM OutPut Low Impedance to Output in High Impedance 

tRAMLZ - RAM Output in High Impedance to Output in Low Impedance 

tRAMHD - RAM Data Hold Time 


tDS - IDT79R3000 Data Set-up Time 

tsys - Phase Difference between Clk2xSys and Clk2xPhi 

trd - Phase Difference between Clk2xPhi and Clk2xRd 

tsmp - Phase Difference between Clk2xPhi and Clk2xSmp 

tcyc - Cycle time of the IDT79R3000 

tsmp-rd = tsmp - trd 


t240PD - Propagation delay from Clk to Output of IDT54/74FCT240A 
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USING XI 6 LATCHED RAMS AS CACHE FOR 
THE IDT79R3000 ON A SURFACE MOUNT 
DESIGN 

Assumptions for Surface Mount Design Layout Using 
x16 Latched RAMs as Cache for the IDT79R3000 

In this section the RAM timings are calculated for a 4K x 1 6 
IDT71586 which has the latches built in. For the static RAMs 
with latches built in, the address access times, tRAMAA, and 
the address set-up to end of write, tRAMAW will change from 
those of a regular static RAM. The propagation delay due to 
the latches is eliminated, increasing the access time and the 
address set-up to end of write by about 5ns. In addition, the 
board layout is different because the distances from the CPU 
to the RAM are reduced. This decreases the derating factors 
by afinite amount. This section calculates the derating factors 
for an IDT79R3000 cache design using the IDT71586 as 
cache. These are the following assumptions: 

1) The trace has a capacitance of 2pF/inch. 

2) The speed of light is 2ns/foot in epoxy. 


3) The rated load of the IDT79R3000 at which the timings 
are specified is 25pF. There is a 1 ns derating for every 
additional 25pF. 

4) The distances betweenthelDT79R3000andthe latches 
are approximately 5 inches. 

5) The distances between the IDT79R3000 and the RAMs 
are approximately 2 inches each. 

6) It is assumed that surface mount packages are used. 
The input capacitance of the RAMs is a typical value 
(7pF) for a PLCC package. 

Derating Calculations Using IDT71586 as Cache RAMs 

The derating factors for the IDT71586 cache RAMs follow 
the same methodology as explained in the section entitled 
Dearting Calculations and Cache Timing Considerations 
Using x4 SRAMs. The cache size is 4K words for instruction 
and 4K words for data. The latches are eliminated. The 
derating factors for the address and data bus are calculated. 

Figure 13 shows a cache system for the IDT79R3000 with 
the latched RAMs (i.e., IDT71586 as the cache). There are a 
total of 8 such devices required for a 4K word size of instruction 
and data cache. 



For each cache, i.e., Instruction or Data Cache — Size is 4KB 
Number of IDT71586S for Data and Tag = 8 

2853 drw 13 

Figure 13. IDT71586 Used as Cache RAMs for the IDT79R3000, Cache Size = 4KWords 
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Figure 14. Surface Mount Board Layout of an IDT79R3000 System Using IDT71586 as Cache 
and Approximate Distances Between Devices 


Figure 14 shows an example layout of an IDT79R3000 Address Bus Derating Calculations 
surface mount design board using latched RAMs (IDT71586) Each AdrLo bus is connected to eight latched RAMs (i.e., 
as cache for the IDT79R3000 system. The distance between the IDT71586) and the address latch for main memory writes 
the IDT79R3000 data pins and the caches is about 2 inches, and reads. (Figure 15). 

The total trace length for the address bus and the data bus is 
about 4 inches each. 



Figure 15. Number of Devices Connected to Address Bus 


2853 drw 15 


Trace length from the CPU to the address latch (for main memory) = 4 inches — (74) 
Capacitance of the trace = Ctrace = 4 x 2pF/inch = 8pF — (75) 

Input capacitance of the 373 latch = lOpF — (76) 

Total input capacitance due to 8 devices = 8x7 = 56pF — (77) 

Total capacitance due to the load = 8 + 56 + 10 = 74pF — (78) 

The rated IDT79R3000 load = 25pF — (79) 

Extra loading on the IDT79R3000 = 74 - 25 = 49pF — (80) 

The delay can be calculated as follows: 

For every extra 25pF of load, there is a delay of 1 ns — (81 ) 

From Eq. 80 and Eq. 81 , delay due to the capacitive load = 49/25 = 2ns — (82) 
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The speed of light = 2ns/foot — (83) 


For a maximum path length of 3", delay = 3712" x 2 = 0.5ns — (84) 


From Eq. 82 and Eq. 84, 

Total propagation delay for the address bus = 2 + 0.5 = 2.5ns — (85) 


From the above calculations, it is seen that the derating on the address bus is 2.5ns. 


Data Bus Derating Calculations 

From Figure 1 6, it is seen that the data bus is connected to 
the floating point unit (IDT79R3010), two IDT71586 devices, 
one read register (IDT54/74FCT374A) and one write register 


(IDT54/74FCT823B). As in the previous section where we 
considered a 16K x 4 static RAM, we have to calculate the 
deratings for two cases: i) for an instruction fetch, and ii) for 
a data store. 


Bit DO 
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Figure 16. Devices Data Bus Connected to the IDT79R3000 


Data Store (IDT79R3000 Outputs Data) 

Each data bit is connected to two RAM devices — one for instruction and one for data. 

The path length of the data bus = 4 inches — (86) 

Trace capacitance of the data bus = 4 x 2pF/inch = 8pF — (87) 

Capacitive loading on the data bus due to the different devices = 2 x CRAMin + CR30ioin + C374out+ C823in = 
2x7 + 10 + 12 + 10 = 46pF — (88) 

Total capacitive load = Cdevices + Ctrace = 46 + 8 = 54pF — (89) 

Propagation delay due to speed of light = 4712" x 2 = 0.6ns — (90) 

Delay due to capacitive load = (54 - 25) /25 = 1.16ns — (91) 

Total delay = 1.16 + 0.7 = 1.8ns = 2ns— (92) 

Load Data Into IDT79R3000 (RAM Outputs Data) 

Since the trace length is the same, the trace capacitance Ctrace = 8pF — (93) 

Capacitive load = CR3000in + CR3010in + C 7 1586in + C374in + C823out — (94) 

Cdevices = 10 + 10 + 7+12 + 10 = 49pF — (95) 
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C total = 49 + 8 = 57pF — (96) 


The RAM rated drive = 30pF — (97) 


Propagation delay due to extra capacitive loading = (57 - 30) / 25 = 1 ,08ns — (98) 
Propagation delay due to path length = 0.8ns — (99) 


Total propagation delay = 1 .08 + 0.8 = 2ns — (1 00) 


Read and Write Control Derating Calculations 

The effect of the capacitance on the control signals 
from the I DT79R3000 processorto the caches and the memory 
interface is considered here. The c ontro l signals on the 
IDT79R3000arethelRd, DRd, IWrand DWr which control the 
instruction cache read, data cache read, instruction cache 
write and data cache write, respectively. The read and write 


control signals are connected to the output enable (OE) and 
write enable (WE) of the instruction and data cache, 
respectively. Two control signals each are provided for the 
read and write operations of each of the caches. Assuming 
the use of a 4K x 1 6 IDT71 586 static RAM, each control signal 
is connected to 2 such static RAMs. 


Number of devices (SRAM) connected to each control line = 2 — (101) 
Input capacitance of each device (SRAM) = 5pF — (102) 

Total load capacitance = 2x5 = lOpF — (103) 

Path length = 4" — (104) 

Trace Capacitance = 4 x 2pF/in = 8pF = 8pF — (1 05) 

Total capacitance = 10 + 8 = 18pF — (106) 


There is no extra capacitive loading here as the rated IDT79R3000 load is 25pF 
Propagation delay due to the trace length = 0.8ns — (107) 


Total propagation delay = 0.8 = Ins — (108) 

Timing Equations for Cache Design 

This section contains the timing equations that enable us to 
determine the critical timing requirements of the static RAM 
that will be used as cache. These equations are based on the 
use of static RAMs without built-in latches as cache RAMs. 
The superscript ‘d’ in the following equations denotes the 
deratings to be taken into account. The static RAM chosen for 
illustration is a 4K x 16 IDT71586. The board is assumed to 
be surface mount for all speeds of the IDT79R3000. The 


deratings for the surface mount board are 2ns. The deratings 
were derived from certain assumptions. The explanation and 
the methodology used is explained in the previous sections. 
Following is a generalized equation given by the timing 
requirementsfordifferentfrequenciesofthe IDT79R3000. All 
calculations are based on the IDT79R3000 specifications for 
the four speed versions (16, 20, 25 and 33MHz), which are 
found in the IDT data sheets. 


( 1 ) 


Internal Sample to Phase Delay 

This is the time that the processor needs to sample the incoming data. Typically, for the IDT79R3000, 
tsmp > 5. 



9.8 


27 



IDT79R3000 

33MHz SPECIFICATION AND CACHE TIMING 


APPLICATION NOTE AN-61 


(2) RAM Address Access Time 

This equation is used to determine the Address Access time parameter requirements of the static 
RAM. From the timing diagram of Figure 1 6, it is easily calculated. The total cycle time for a 33MHz 
IDT79R3000 is 30ns. If the processor’s sample time requirement is met, the time remaining in the cycle 
is 24ns in which the data has to be presented to the processor. The processor requires a data set-up 
time of 4ns. The derating factors due to the capacitance and the trace length also have to be taken into 
account. Using all these factors, the equation is, 
tRAMAA < tcyc - tsmp - tDS - tAdrLo d - tRAMAA d 

16MHz IDT79R3000: tRAMAA <60-10-9 - 3.5 - 3 
tRAMAA < 34.5 

20MHz IDT79R3000: tRAMAA <50-8-8-2.5-2 
tRAMAA < 29.5 

25MHz IDT79R3000: tRAMAA <40-6-6-2.5-2 
tRAMAA < 23.5 

33MHz IDT79R3000: tRAMAA < 30 - 4.5 - 4.5 - 2.5 - 2 
tRAMAA < 16.5 

(3) Cache Enable to Sample 

This equation is used to determine the output enable requirements of the cache RAM and should meet 
the processor’s set-up specification. The output enable time for the latched RAM is specified by the 
manufacturer and tested for a voltage change of 200mV (1 ,732V to 1 ,532V for IDT RAMs). For a system, 
the voltage falls from a level of 3.3V to 1 .732 and the added fall time must be considered when specifying 
the RAM toE parameter. This fall time is approximately one additional nanosecond. Therefore, the RAM 
tOE parameter should be one nanosecond lower than the calculated numbers below. 
tOES < tcyc/2 - tRD d - tDS - tsys-smp + tsys-rd - tOES d 

1 6MHz IDT79R3000: tOES <30-2-9-10 + 10-3 
tOES < 1 6 

20MHz IDT79R3000: tOES <25-1-8-8 + 8- 2 
tOES < 14 

25MHz IDT79R3000: tOES <20-1-6-6 + 6- 2 
tOES < 1 1 

33MHz IDT79R3000: tOES <15-1-4-4.5 + 4.5-2 
tOES <8 

(4) Minimum Read Pulse Width 

This timing requirement guarantees that the read pulse width generated by the processor is at least 
as long as the cache RAM output enable time. 
tOES < tcyc/2 - tsys-rd - tOES d 

1 6MHz IDT79R3000: tOES <30-10-3 
tOES < 1 7 

20MHz IDT79R3000: tOES <25-8-2 
tOES < 15 

25MHz IDT79R3000: tOES <20-6-2 
tOES < 1 2 

33MHz IDT79R3000: tOES <15-4.5-2 
tOES < 8.5 
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(5) Read-Write !-Cache Data Bus Contention 

This timing requirement ensures that the RAM output is tri-stated soon enough after the instruction 
read signal goes high. In the worst case, when the processor performs a store operation, no data 
contention occurs. 
tRAMHZ < tsys - tRd d + DEn 

16MHz IDT79R3000: tRAMHZ < 16 - 2 + (-2.5) 
tRAMHZ <11.5 

20MHz IDT79R3000: tRAMHZ < 14 - 1 + (-2) 
tRAMHZ <11 

25MHz IDT79R3000: tRAMHZ < 12 - 1 + (-1.5) 
tRAMHZ < 9.5 

33MHz IDT79R3000: tRAMHZ < 9 - 1 + (-1) 
tRAMHZ < 7 

(6) Processor Data Set-Up to End of Write 

This enables the designer to determine whether the cache RAMs have adequate data set-up time 
when the processor does a store operation. In the equation, the minimum derating is used on the write 
line (i.e., tw^i as that is the worst case assumption. 
tRAMDS < tcyc/2 - tsys-smp - tDVal - tDVaf - tWr d 

16MHz IDT79R3000: tRAMDS < 30 - 10 - 3 - 3 - (-2) 
tRAMDS < 16 

20MHz IDT79R3000: tRAMDS < 25 - 8 - 3 - 2 - (-1) 
tRAMDS < 13 

25MHz IDT79R3000: tRAMDS < 20 - 6 - 2 - 2 - (-1) 
tRAMDS < 11 

33MHz IDT79R3000: tRAMDS < 15 - 4.5 - 2 - 2 - (-1) 
tRAMDS < 7.5 

(7) Data Hold from End of Write 

This parameter requirement guarantees that the data hold from end of write of the cache RAM is met 
when the processor or the read buffer is writing to the RAMs. 
tRAMHD < tsmp-rd + tRAMLZ 

16MHz IDT79R3000: tRAMHD <0 + 2 
tRAMHD < 2 

20MHz IDT79R3000: tRAMHD <0 + 2 
tRAMHD < 2 

25MHz IDT79R3000: tRAMHD <0 + 2 
tRAMHD < 2 

33MHz IDT79R3000: tRAMHD <0 + 2 
tRAMHD < 2 
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(8) Data Set-Up to SysClk 

This timing parameter ensures that the set-up time into an external register (for the main memory 
interface) is sufficient enough for the case w hen the p rocessor is doing a store. The data is clocked in 
the register on the rising edge of the buffered SysOut (through an inverting IDT54/74FCT240A). In this 
equation, tsys(min) d is used to ensure worst case calculations. 
tsetupSys < tcyc/2 - tsys - tDVal - tDVal d + tsys d + t240PDmin 

1 6MHz IDT79R3000: tSetUpSys < 30 - 1 6 - 3 - 3 + 2 + 1 .5 
tSetUpSys < 11.5 

20MHz IDT79R3000: tSetUpSys <25-12-3-2 + 1+1.5 
tSetUpSys < 10.5 

25MHz IDT79R3000: tSetUpSys <20-12-2-2 + 1+1.5 
tSetUpSys < 6.5 

33MHz IDT79R3000: tSetUpSys <15-9-2-2 + 1+1. 5 
tSetUpSys < 4.5 

(9) Data Hold from SysClk 

This timing parameter is to guarantee that the hold time specification for an external register is met 
on a processor store. In this equation the minimum value of tRD d is taken to ensure worst case numbers. 
tHoldSys < tsys-rd - tsys d - t240PDmax + tRAMLZ + tRd d 

16MHz IDT79R3000: tHoldSys < 6 - 2 - 4.8 + 2 + 1 
tHoldSys < 2.2 

20MHz IDT79R3000: tHoldSys < 6 - 1 - 4.8 + 2 + 1 
tHoldSys < 3.2 

25MHz IDT79R3000: tHoldSys < 6 - 1 - 4.8 + 2 + 1 
tHoldSys < 3.2 

33MHz IDT79R3000: tHoldSys < 4.5 - 1 - 4.8 + 2 + 1 
tHoldSys < 1 .9 

(10) Address Set-Up to End of Write: 

This equation enables us to determine the timing requirement for the RAM so that the address set- 
up time is sufficient before the trailing edge of the write pulse. 
tRAMAW < tcyc - tsmp-sys - tAdrLo d + tWr'* 

16MHz IDT79R3000: tRAMAW < 60-10-3.5 + 2 
tRAMAW < 48.5 

20MHz IDT79R3000: tRAMAW <50-8-2.5 + 1 
tRAMAW < 40.5 

25MHz IDT79R3000: tRAMAW <40-6-2.5 + 1 
tRAMAW < 32.5 

33MHz IDT79R3000: tRAMAW < 30 - 4.5 -2.5 + 1 
tRAMAW < 24 
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(11) Write Hold Pulse Width: 

This requirement guarantees that the cache RAMs minimum write pulse width specification is met. 
tFlAMPW < tcyc/2 - tWrDly 


16MHz IDT79R3000: tRAMPW <30-5 
tRAMPW < 25 

20MHz IDT79R3000: tRAMPW <25-4 
tRAMPW < 21 

25MHz IDT79R3000: tRAMPW <20-3 
tRAMPW < 17 


33MHz IDT79R3000: tRAMPW <15-2 
tRAMPW < 13 


From the above calculations and Figure 1 5, it can be seen 
that the data set-up to the processor is met. The output enable 
of the RAM , which is controlled by I Rd, goes high and the RAM 
output starts to go tri-state. From the figure, the reader may 
correctly question whether the hold time requirements of the 


IDT79R3000 are met. They are indeed met by the capacitance 
on the bus and also because CMOS devices are being used. 
The technical note entitled Meeting Bus Hold for the 
IDT79R3000 gives a more detailed explanation. 
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Figure 17. Address Setup and Hold Timing for a Latched RAM (25MHz IDT79R3000) 
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From Figure 1 7, it is clearly seen that the address setup and 
hold time forthe latched RAMs are met by using ICIk to capture 
the instruction address. Figure 1 7 is to illustrate the timings for 


a 25MHz IDT71586 latched RAM. Similar timing diagrams 
can be drawn to verify the setup and hold times for I DT79R3000 
operating at different frequencies. 
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Legend: 

tRAMAA - RAM Access Time 

tRAMOE - RAM Output Enable Time 

tRAMHZ - RAM OutPut Low Impedance to Output in High Impedance 

tRAMLZ - RAM Output in High Impedance to Output in Low Impedance 

tFiAMHD - RAM Data Hold Time 

tDS - IDT79R3000 Data Set-up Time 

tsys - Phase Difference between Clk2xSys and Clk2xPhi 

trd - Phase Difference between Clk2xPhi and Clk2xRd 

tsmp - Phase Difference between Clk2xPhi and Clk2xSmp 

tcyc - Cycle time of the IDT79R3000 

tsmp-rd = tsmp - trd 

t240PD - Propagation delay from Clk to Output of IDT54/74FCT240A 


READ CYCLE TIMING SPECIFICATIONS 


Parameter 

Description 

16.7MHZ 

20.0MHz 

25.0MHz 

33.0MHz 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

tRC 

Read Cycle 

35 

— 

30 

— 

25 

— 

15 

— 

tCH 

ALEN High 

10 

— 

10 

— 

10 

— 

8 

— 

tCL 

ALEN Low 

10 

— 

10 

— 

10 

— 

8 

— 

tAS 

Add. Latch Set-Up 

5 

— 

5 

_ 

5 

— 

4 

— 

tAH 

Add. Latch Hold 

5 

— 

5 

— 

5 

— 

4 

— 

tAA 

Address Access 

— 

35 

— 

30 

— 

24 

— 

15 

tACE 

Chip Enable Access 

— 

35 

— 

30 

— 

24 

— 

15 

tOE 

Output Enable 

— 

16 

— 

14 

— 

11 

— 

8 

tCLZ 

CE to Out in LZ 

3 

— 

3 

— 

3 

— 

3 

~ 

tOLZ 

CE to Out in LZ 

2 

— 

2 

— 

2 

— 

2 

— 

tCHZ 

CE to Out in HZ 

— 

25 

— 

22 

— 

20 

— 

15 

tOHZ 

CE to Out in HZ 

— 

11 

— 

11 

— 

9 

— 

7 

tOH 

Output Hold from Address Change 

3 

— 

3 

— 

3 

— 

3 

— 
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Table 7. Read Cycle Timings for an IDT Static RAM with Latches 
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WRITE CYCLE TIMING SPECIFICATIONS 


Parameter 

Description 

16.7MHZ 

20.0MHz 

25.0MHz 

33.0MHz 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

twc 

Write Cycle 

35 

— 

30 

— 

25 

— 

15 

— 

tCH 

ALEN High 

10 

— 

10 

— 

10 

— 

8 

— 

tCL 

ALEN Low 

10 

— 

10 

— 

10 

— 

8 

— 

tAS 

Add. Latch Set-Up 

5 

— 

5 

— 

5 

— 

4 

— 

tAH 

Add. Latch Hold 

5 

— 

5 

— 

5 

— 

4 

— 

tAW 

Add. to End-of-Write 

35 

— 

30 

— 

25 

— 

15 

— 

tASW 

Add. Set-Up 

0 

— 

0 

— 

0 

— 

0 

— 

tWP 

Write Pulse Width 

25 

— 

20 

— 

17 

— 

11 

— 

tew 

CE to End-of-Write 

25 

— 

20 

_ 

20 

— 

11 

— 

tWR 

Write Recovery 

0 

— 

0 

— 

0 

— 

0 

— 

tWHZ 

Write to Out in HZ 

— 

15 

— 

15 

— 

13 

— 

8 

tDW 

Data Set-Up 

16 

— 

13 

— 

11 

— 

7 

— 

tDH 

Data Hold 

0 

— 

0 

— 

0 

— 

0 

— 

tow 

Out Active from End-of-Write 

5 

— 

5 

— 

5 

— 

5 

— 
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Table 8. Write Cycle Timings for an IDT Static RAM with Latches 
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INTRODUCTION 

System design involves trade-offs between several 
components to achieve the desired price/performance ratio. 
Existing development tools of the IDT79R3000 enable the 
designer to measure and compare the price/performance of 
various configurations. cache2000, Pixie and Pixstats are 
software tools that allow the designer to analyze the 
performance of the simulated IDT79R3000 system by 
executing a designer’s application program on the proposed 
system. Pixie and Pixstats are part of the RISC/os, while 
cache2000 is distributed with the System Programmer's 
Package (SPP). SPP is a set of development tools that 
include monitors, a standalone C compiler, a standalone I/O 
library, local and remote debugging tools, downloading software 
via RS232 and Ethernet, cache2000 that simulates memory 
subsystem, and complete system simulation software called 
SABLE. SABLE simulates IDT79R3000/3001 instructions, 
caches, TLB behavior, a disk drive and a DU ART for a simple 
console interface. The simulation results of the cache2000 
and Pixstats provide the designer with all the information 


needed to determine the best possible price/performance 
solution. 

MEMORY SUBSYSTEM 

To meet the high processing speed of the MIPS RISC 
processors, the memory subsystem (Figure 1) is usually 
structured into a hierarchy of small high-speed cache memory, 
read/write buffers and large, slow main memory. The cache 
memory is typically made up of an instruction cache and a data 
cache. These cache memories allow the CPU to fetch one 
instruction and one word of data in every clock cycle. To retire 
the writes to the main memory in one clock cycle, it is common 
to use a write buffer as an interface between the high-speed 
CPU and the slower main memory. The write bufferfunctions 
as a FIFO of multiple levels and has the ability to perform 
special functions to minimize the main memory bus traffic. 
cache2000 allows the user to choose various parameters for 
these three basic blocks of the memory hierarchy and then 
analyze the performance of the proposed memory system. 



Figure 1. Typical Memory Subsystem IDT79R3000 


2854 drwOI 


cache2000 

cache2000 is a software tool that simulates a proposed 
IDT79R3000/3001 memory subsystem. It analyzes the 
memory references made by an application program during its 
execution and generates various statistics about its dynamic 
behavior. cache2000 determines the execution time taken by 
the user’s application program by simulating the penalties 


involved in accessing the memory. But, it does not take into 
account any interlock cycles of the CPU or the FPA. These 
interlock cycles can be determined from the output of Pixstats. 
The main memory model simulated is Page mode, and the 
latencies associated are changeable. By simulating different 
memory subsystems with the cache2000, the user can 
determine the performance of the application program on 
those systems and can arrive at an optimal solution. 
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PIXIE 

Pixie is a Unix program that generates a new executable of 
the user’s code by adding instructions that trace pieces of 
code known as basic blocks. This new executable is referred 
to as pixified executable. A basic block is a sequence of 
instructions with a single entry point and a single exit point. 
Once processing starts at the entry point, it will proceed until 
the exit point without branching. Along with the modified 
executable, Pixie will also generate a file (with suffix Addrs) 
that contains the addresses of these basic blocks. When the 
pixified executable is run (by typing its name), a new file with 
the execution count of each basic block (with suffix Counts) is 
generated. Programs like cache2000 and Pixstats use the 
Addrs and Counts files, along with the pixified executable, to 
generate formatted profile information. When using cache2000, 
the user’s program must be pixified with -idtrace option to 
generate memory referencing information of instructions and 
data. When using Pixstats, the user’s program need not be 
pixified with any option. 

PIXSTATS 

Pixstats is also a Unix program that analyzes the program’s 
execution characteristics. Itgenerates statistics regardingthe 
opcode frequencies, interlocks and a mini-profile of the program 
execution. To run Pixstats, the user’s program needs to be 
pixified (running Pixie) without any option. To generate the 
Counts file, the pixified program must be executed. Pixstats 
assumes that the user’s program fits into the caches and, 
therefore, does not add any memory penalties. The integer 
multiply/divide interlock cycles and the floating point interlocks 
listed in the output of Pixstats needs to be added to the number 
of cycles given by cache2000 to accurately determine the 
total number of cycles the user’s program takes for execution. 
This execution time is with an FPA in the system, as the 
IDT79R3000/3001 compilers assume the presence of an FPA 
in the system at the time of code generation. If FPA does not 
exist, the FPA instructions are emulated in software. Every 
floating point instruction will cause an exception and the 
exception handlerwill invoke the emulating software. A listing 
of the emulation overhead is provided in the following pages. 
The overhead and the number of the floating point accelerator 
instructions contained in the Pixstats output will help the 
designer in estimating the overhead of a system without an 
FPA. 

MODELING MEMORY SUBSYSTEMS WITH 
CACHE2000 

To simulate different memory subsystems, cache2000 is 
modified to the desired parameters of the proposed system. 
Creating a cache2000 model to simulate the desired memory 
subsystem can be done either by editing the source file 
“cache2000.c", or defining the parameters at Unix command 
level. Allthe possible changeable parameters can be assigned 
new values when the source file is edited. Once the source file 
is edited, it should be compiled to create the executable of 


cache2000. To compile cache2000.c , an include file called 
“trace. h" must be in the current directory. Command level 
modifications can be done at compilation time or at execution 
time of cache2000. The parameters of the caches can be 
changed by redefining them when invoking the compiler. The 
main memory latencies and the write buffer depth can be 
altered at the time of execution. (A complete list of all the 
options of cache2000, Pixie and Pixstats is enclosed at the 
end of this application note.) The options include: 

(a) Editing the source file cache2000.c. — Invoke an editor (vi 
or emacs) on cache2000.c and modify any changeable 
parameter. The list of changeable parameters is given at 
the end. Save the changes and compile the source file 
with the C compiler. To compile cache2000.c, the file 
trace. h should also be present in the current directory. 
The highest level of optimization (04) is used to produce 
the most optimal executable. The math library is linked by 
using -Im. Once cache2000.c is modified with all the 
required parameter values, there is no need to use any 
options either at compile time or at run time. 

cc -04 -o cache2000 cache2000.c -Im 

(b) Making changes to the Cache2000 at compilation time by 
redefining the parameters — 

cc -04 -o cache2000 -DLsizeJog=12 -Dd_sizejog=10 
cache2000.c -Im 


This compiles a cache2000 that models an instruction 
cache of 4K words and data cache of 1 K words. The compile 
time changeable parameters of cache2000 are: 


Parameter Name 

i_refill_log 

Lsizejog 

d_refill_log 

d_size_log 

byte _ gathering 

read_conflict_check 

istreaming 


Possible Values 

0,2, 4, 5 

10,11,12,13,14,15,16 
0,2, 4, 5 

10,11,12,13,14,15,16 

0,1 

0,1 

0,1 


(c) Setting the parameters at the execution time of cache2000 
— The memory latencies can be set at the time of 
simulation. cache2000 is executed in parallel with the 
pixified executable of the user’s application program. The 
Unix command “makepipe” is used to run these two 
processes ( cache2000 and user’s program) in parallel. 
The user’s program (pixified with -idtrace), when executed, 
will generate memory referencing information to a Unix 
file with file descriptor 19. This information is piped (by the 
makepipe command) to the standard input file (file 
descriptor 0). cache2000 reads the standard input and 
proceeds with the memory subsystem simulation. An 
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example of setting up the read latency to 6 cycles is: 

makepipe19my_prog.pixie‘l’0cache2000-read_latency 
6 > my_prog.cache2000 

Now cache2000 models a main memory with a read 
memory latency of 6 cycles. 

STEPS IN PERFORMANCE ANALYSIS 

The steps involved in measuring the performance of the 
IDT79R3000/3001 -based systems are: 


makepipe 19 my _prog. pixie '/' 0 cache2000 -readjatency 
6> my _prog.cache2000 

(5) Run Pixstatstofindoutthe interlock cycles of IDT79R3000/ 
3001 integer multiply/divide unit and the interlock cycles 
of the FPA IDT79R3010. These interlock cycles should 
be added to the cycles from the cache2000 output. The 
output of Pixstats also contains the IDT79R3000/3001 
opcode distribution. This provides the designer with the 
percentages of the FPA instructions in the application 
program. Running Pixstats is a three step process. The 
steps are: 


(1) Compile the source of the application program with the 
desired level of optimization to create the executable 
module. 

cc -04 -o my_prog my_prog.c 

(2) Create a cache2000 executable to model the proposed 
memory subsystem by compiling the cache2000.c. 

cc -04 -o cache2000 -Di_sizeJog=12 -Dd_sizejog=10 
-Di_refill_log= 5 -Distreaming= 1 cache2000.c -Im 

This compiles a cache2000 with instruction cache of 4K 
words, data cache of 1 K words and 32 words of block refill 
for instruction cache with instruction streaming enabled. 

(3) Pixify the application program with -idtrace. 

pixie -idtrace -o my jerog. pixie my _prog 

The above command pixifies the application program 
my_prog and places the output in my_prog. pixie. 

(4) Use makepipe to run the pixified application program and 
Cache2000 in parallel. 

makepipe 19 my_prog. pixie 0 cache2000 > 
my jprog.cache2000 


(a) Repixify the application program without any option. 

(b) Run the pixified program by typing in the name of the 
pixified output to generate the basic block counts file. 

(c) Run Pixstats. 

The commands to accomplish these tasks are: 

(d) Pixify the application program. 

pixie -o my_prog. pixie myprog 

This generates my_prog.Addrs file containing the 
basic block addresses. 

(e) Run the pixified program 
my jprog.pixie 

This generates my_prog. Counts file containing the 
execution count of the basic blocks. 

(f) Run Pixstats. 

pixstats my_prog > my _prog. pixstats 

Pixstats uses the Addrs and Counts files and redirects 
the output to myjarog. pixstats. 


This command pipes the memory references generated 
by my __prog. pixie to the cache2000. The statistical 
information produced by cache2000\o the standard output 
can be redirected to my_prog.cache2000. Arguments, 
input file, output file for the user’s application program can 
be specified by: 

makepipe 19my_prog.pixieargs '<’ input _file’>’ outputjite 
T 0 cache2000 > my_prog.cache2000 

By running the makepipe in the background, the designer 
can fire up more than one simulation. The following 
command sets up the read latency from the main memory 
to 6 cycles and runs the simulation in the background. 


cache2000 OUTPUT 


The cache2000 program prints out the statistical information 
to the standard output file. It can be redirected to any desired 
file. The statistical information is generated every time after 
the execution of user-specified number of cycles. By setting 
the print variable to an extremely large number (20000000000) , 
the output generation can be restricted to the final results. 

The following is one such output file for a typical image 
processing application program written in C language. The 
executable size is 500Kbytes, the text segment is 300Kbytes 
and data segment is 200Kbytes consisting mainly of 
uninitialized data items of size greater than 512 bytes. The 
cache2000 models a memory subsystem of 8Kbytes of 
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instruction cache, 8Kbytes of data cache, 4 words of block 
refill size for the l-cache and 1 word refill for the D-cache. 
Instruction streaming is enabled. The cache flushes are 
turned off to model an embedded application. The clock 


speed is 16.67MHz. The Page mode main memory read 
latency is 5 cycles, giving an access time of 300ns. 

Lines in the file are numbered in the first column for 
referencing in the explanation. 


1 cache2000 -flush 20000 -cycle 60 -clock 1 6.67 -wbsize 1 -readjatency 5: 

2 Fri Oct 13 21:39:05 1989 

3 73398701 cycles (1 .569), 4.4s @ 1 6.7MHz 

4 46783524 instructions (46.8M) 

5 0 cache flushes 


6 

word read 

9008972 

19.26% 

7 

double read 

0 

0.00% 

8 

word write 

4161532 

8.90% 

9 

double write 

0 

0.00% 

10 

byte write 

445222 

0.95% 

11 

half write 

438867 

0.94% 

12 

swr 

0 

0.00% 

13 

swl 

73 

0.00% 

14 

Iwcl 

255500 

0.55% 

15 

Idcl 

0 

0.00% 

16 

swcl 

5635 

0.14% 

17 

sdcl 

0 

0.00% 

18 

basic block 

8615556 

18.42% 

19 

(null) 

0 

0.00% 


20 l-cache size = 2048 words, direct-mapped, 4 word refill 

21 D-cache size = 2048 words, direct-mapped, 1 word ref ill, write-through 

22 Write buffer = 1 deep, conflict checking on d-miss, byte gathering 

23 TLBsize = 56 entries, associative, random replacement, 
page size =1024words 


per per per 

instr cycle other 


24 

uTLB misses: 

318004 

(0.68%/ 0.43%) 

25 

l-TLB misses: 

9394 

(0.02%/ 0.01%) 

26 

D-TLB misses: 

7927 

(4.29%/ 0.01%/ 0.06%) 

27 

l-cache misses: 

2007056 

(0.01%/ 2.73%) 

28 

D-cache misses: 

234278 

(0.50%/ 0.32%/ 2.53%) 

29 

Idle writes: 

2171982 

(4.64%/ 2.96%/ 42.5%) 
(5 memory cycles) 

30 

Page mode writes: 

2561693 

(5.48%/ 3.49%/ 50.1%) 
(2 memory cycles) 

31 

Non-page writes: 

377581 

(0.81%/ 0.51%/ 7.4%) 
(6 memory cycles) 

32 

Total writes: 

5111256 

(10.93%/ 6.96%) 

33 

Write merges 

0 

(0.00%/ 0.00%/ 0.00%) 

34 

l-stream branch: 

351599 

(0.75%/ 0.48%/ 17.5%) 

35 

l-stream d-miss: 

31036 

(0.07%/ 0.04%/ 1.5%) 

36 

l-stream write: 

252829 

(0.54%/ 0.34%/ 12.6%) 

37 

l-stream block: 

1371592 

(2.93%/ 1.87%/ 68.3%) 

38 

l-stream words: 

0.4/2. 1/1. 6 
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39 

uTLB miss cycles: 

318004 

( 0.68%/ 0.43%) (penalty 1) 

40 

l-TLB miss cycles: 

122122 

(0.26%/ 0.17%) (penalty 13) 

41 

D-TLB miss cycles: 

103051 

(0.22%/ 0.14%) (penalty 13) 

42 

l-cache miss cycles: 

18063504 

(38.61%/ 24.61%) (penalty 9) 

43 

l-cache streaming: 

-4155427 

(-8.88%/ -5.66%) 

44 

D-cache miss cycles: 

1405668 

(3.00%/ 1.92%) (penalty 6) 

45 

2-cycle SB/SH/SWL/SWR : 

884162 

(1.89%/ 1.20%) (penalty 1) 

46 

Write buffer full cycles: 
(average 1 .3 per write) 

6780214 

(14.49%/ 9.24%) 

47 

Write wait cycles: 

(average 1 .4 per miss) 

3093879 

(6.61%/ 4.22%) 

48 

Subtotal: 

26615177 

(56.89%/ 36.26%) 

49 

Instructions: 

46783524 

(100.00%/ 63.74%) 

50 

Total: 

73398701 

(156.89%/ 100.00%) 

51 

TLB memory cycles: 

111030 

(0.24%/ 0.15%) 

52 

Memory bus cycles: 

33976968 

(72.63%/ 46.29%) 

53 

simulation runtime: 

248.4U 23.4s, 5473.6w/5%, 172 Kinst/S, 255 KrefS/S 

54 

DONE! 




EXPLANATION OF FIELDS 

cache2000 Runtime Parameters 

1 cache2000 -flush 20000 -cycle 60 -clock 16.67 -wbsize 1 -readjatency 5: 

2 Fri Oct 13 21:39:05 1989 

All the cache2000 runtime parameters that are set by the set the cache flush interval. By choosing a large number (like 
user are listed in this line. The cache flush interval is set to 20 billion) for this variable, this flushing can be turned off for 
20,000 million cycles. This implies that the caches are flushed embedded applications that do not run Multiprogramming/ 
for every 20 billion cycles of user’s program execution. In Multitasking software. The clock speed selected by the user 
Multiprogramming/Multitasking systems, on context-switching, is 16.67MHz and, therefore, the cycle time is 60 nanoseconds, 
implicit cache flushing occurs due to a series of cache misses. The write buffer simulated is 1 word deep. The Page mode 
These cache misses are equivalent to caches being flushed, main memory read latency on load misses is 5 cycles or 300 
To simulate that environment, cache2000a\\ows the user to nanoseconds. 

Cycles and Instructions 

3 73398701 cycles (1.569), 4.4s @ 16.7MHz 

4 46783524 instructions (46. 8M) 

The user’s application program takes 73398701 cycles for this number approaches one. The clock speed can be set to 
complete execution. To execute these cycles, the processor the desired frequency with -clock option at execution time of 
takes 4.4 seconds when running at 16.67MHz. It takes an the cache2000. Line 4 gives the number of instructions 
average of 1 .569 cycles per instruction for simulation of the executed. The last field in line 4 is the number of instructions 
current memory subsystem and of the current user’s application in millions. The number of instructions is independent of the 
program. As we change the parameters for the memory memory system being simulated. It only depends on the 
subsystem, notice that this average number of cycles per user’s program, the data files the program is using and the 
instructionwillalsochange. Forafine tuned memory subsystem optimization level used in compiling the program. 

Cache Flushes 

5 0 cache flushes 

In the current simulation, the caches are flushed for every are zero for this example, as the program does not execute 20 
20,000 million instructions to simulate an embedded application billion instructions, 
environment without any context switching. The cache flushes 
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Reads 

6 word read 9008972 19.26% 

7 double read 0 0.00% 


These two lines give the number of read instructions and 
their percentage of the total number of instructions. In the 
current simulation, 19.26% of the total instructions are word 
(32 bits) load instructions. For cached data/instructions, 
IDT79R3000 treats all load instructions as load word 


instructions since a word is read from the caches on cache 
hits. On a cache miss, a word or multiple words are read from 
the memory. The double reads on line 7 are not applicable to 
the IDT79R3000. 


Writes 


8 

word write 

4161532 

8.90% 

9 

double write 

0 

0.00% 

10 

byte write 

445222 

0.95% 

11 

half write 

438867 

0.94% 

12 

swr 

0 

0.00% 

13 

swl 

73 

0.00% 


These lines list all the store instructions executed by the 
processor. All the load/store instructions work only with the 
data cache if the load/store address is cached. In this 
example, almost 9% of the stores are store word instructions. 
This is almost half of the word read percentage, which is 
typical of many applications. Double writes are not applicable 
to the IDT79R3000. For partial word writes (sb,sh,swr,swl), 


the IDT79R3000 does a read-modify-write operation. The 
processor does a load from the cache (on cache hit) at the 
store address, merges the data to be stored with the data 
loaded and writes the result back to cache. On a cache miss 
for the load, the partial word is written to the main memory 
(through a write buffer), leaving the cache untouched. 


FPA (cpl) Read and Writes 


14 

Iwcl 

255500 

0.55% 

15 

Idcl 

0 

0.00% 

16 

swcl 

65635 

0.14% 

17 

sdcl 

0 

0.00% 


The number of reads and writes to/from the FPA instructions are executed by the IDT79R3000. Forthecurrent 
(coprocessor 1 ) are given by the above four lines. The double user’s program, there are 255500 FPA load instructions. The 
word load and store (Idcl.sdcl) are not applicable to the percentage of these loads to the total number of instructions 
IDT79R3000. The coprocessor load/store instructions are is 0.55. The number of store instructions is 65635 and 0.1 4 is 
word operations. The CPU and the coprocessors are tightly the percentage to the total number of instructions, 
coupled as they share the same data bus. The Iwcl and swcl 

Basic Blocks 

18 basic block 8615556 18.42% 

A basic block is a sequence of instructions with a single jump/branch instruction and exit points are addresses holding 
entry point and a single exit point. Once execution starts at the a jump/branch instruction. The basic block count on line 18 
entry point instruction, it proceeds sequentially until the exit indicates the number of basic blocks of the user’s application 
point instruction. Entry points are the target addresses of a program executed. 

Anulled Instructions 

19 (null) 0 0.00% 

Not applicable for the IDT79R3000. 
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Instruction Cache 

20 l-cache size = 2048 words, direct-mapped, 4 word refill 

21 D-cache size = 2048 words, direct-mapped, 1 word refill, write-through 


The instruction cache and the data cache for the 
IDT79R3000 are direct-mapped. For stores, the data is 
written to the main memory through the data cache, 
implementing the write-through policy. Forpartia! word stores, 
the data cache is looked up for a cache hit. If it misses, the 
partial word is written to the main memory leaving the cache 
unchanged. The l-cache size, D-cache size, l-cache refill size 


and D-cache refill size can be chosen by the user. In the 
current simulation, the caches are 2048 words deep (8Kbytes) 
and the block refill sizes for l-cache and D-cache are 4 words 
and 1 word, respectively. These parameters can be set at 
compilation time of the Cache2000. The possible cache sizes 
are 4, 8, 1 6, 32, 64 , 1 28 and 256 Kbytes. The block refill sizes 
can be selected as 1 , 4, 8, 1 6 or 32 words. 


Micro TLB Misses 

24 uTLB misses: 318004 (0.68%/ 0.43%) 


In translating the virtual address of an instruction to its 
physical address, a two-entry table called uTLB (micro TLB) 
is employed. This micro TLB is not used for data references. 
If a uTLB miss is encountered, the TLB (56 entries) is looked 
up. Whenever there is a uTLB miss, its least recently used 


entry is updated with the appropriate entry from the main TLB. 
The percentage of uTLB misses to the total number of 
instructions and the percentage of the misses to the total 
number of cycles are given in paranthesis. 


TLB Misses 

25 l-TLB misses: 9394 (0.02%/ 0.01%) 

26 D-TLB misses: 7927 (0.02%/ 0.01%/ 0.06%) 


The above two lines account for the number of times the 
mapping information needed ( a TLB entry) to translate the 
virtual address to physical is missing from the TLB. The first 
line gives the number of misses for instructions while the 
second one accounts for the data references. The main 
memory holds the entire page table and on a TLB miss, this 


page table is looked up in order to update the TLB. The TLB 
is a fully associative memory and a random replacement 
algorithm is employed for TLB updates from the page table. 
The last number in paranthesis for D-TLB misses is the 
percentage of the D-TLB misses to the total number of load 
and store instructions. 


Cache Misses 

27 l-cache misses: 2007056 (4.29%/ 2.73%) 

28 D-cache misses: 234278 (0.50%/ 0.32%/ 2.53%) 


The l-cache misses are the number of times the processor 
referred to the instruction cache and could not find the 
instruction. The number of misses on line 27 indicates that the 
2Kwords of l-cache are not quite sufficient for 100% hit rate. 
Increasing the cache size might enhance the performance. 
The first percentage, 4.29%, is the l-cache miss rate, making 
the l-cache hit rate 95.71%. The D-cache misses are the 
number of attempts the processor made to read from the data 


cache but could not locate the word. The data cache miss rate 
is given by the last percentage in paranthesis (2.53%) which 
is the percentage of D-cache misses to the number load word 
instructions (Iw) and the number of load word instructions to 
the coprocessor 1 (Iwcl). The data cache misses, however, 
do not include the load misses occuring in read-modify-write 
operations of partial word stores. 
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Main Memory Writes 


29 

Idle writes: 

2171982 

(4.64%/ 2.96%/ 42.5%) 
(5memory cycles) 

30 

Page mode writes: 

2561693 

(5.48%/ 3.49%/ 50.1%) 
(2 memory cycles) 

31 

Non-page writes: 

377581 

(0.81%/ 0.51%/ 7.4%) 
(6 memory cycles) 

32 

Total writes: 

5111256 

(10.93%/ 6.96%) 

33 

Write Merges 

0 

(0.00%/ 0.00%/ 0.00%) 


cache2000 simulates Page mode main memory. All the 
different types of writes to the main memory are listed here. 
Idle writes are writes to the main memory that do not follow or 
precede another write. Page mode writes are successive 
writes to the same page. Non-page writes are successive 
writes to different pages. Total writes are the summation of 
these three types of writes. The third percentage in the first set 
of parentheses on lines 29 through 31 is the percentage of the 
correspondingwritetothetotal writes. The number of memory 
cycles assumed in the current simulation to perform these 


writes is given in the second set of parentheses. These cycles 
can be specified by the user at run time of cache2000. Write 
merges involve merging the words in the write buffer if the 
words have the same address. These write merges are 
simulated by Cache2000 if byte gathering is turned on. Byte 
gathering is one of the features of the write buffer, IDT79R3020. 
Write merges reduce the number of references made to the 
main memory for data updates. In this simulation with 1 word 
deep write buffer and byte gathering turned on, the number of 
write merges is zero. 


Streaming 

34 l-stream branch: 351599 

35 l-stream d-miss: 31036 

36 l-stream write: 252829 

37 l-stream block: 1371592 

38 l-stream words: 0.4/2. 1/1 .6 


(0.75%/ 0.48%/ 17.5%) 
(0.07%/ 0.04%/ 1.5%) 
(0.54%/ 0.34%/ 12.6%) 
(2.93%/ 1.87%/ 68.3%) 


On every l-cache miss, a block refill number of instructions 
is brought into the instruction cache. Enabling streaming 
allows the CPU to execute these instructions as they get 
written into the instruction cache. In other words, the CPU 
does not have to wait forthe completion of the writing of a block 
of instructions into the instruction cache before it starts the 
execution. Line 37 gives the number of times the CPU 
streamed through a full block. Lines 34 to 36 list the number 
of times the processor dropped out of streaming due to 
branches, data cache misses, write buffer flushes and partial 
word stores. Line 34 gives the number of occasions the 
processor dropped out of streaming because of executing a 
branch/jump instruction. If the CPU is executing a load and 
faces a data cache miss, streaming is aborted; Line 35 
accounts for such cases. The processor also drops out of 
streaming whenever the write buffer is full and the current 
instruction needs to write to the write buffer, or whenever it has 


to perform a read-modify-write operation for partial word store 
instructions, l-stream write on Line 36 gives the number of 
such drop outs. The last line, 38, gives the instruction number 
in the block that caused the l-cache miss (0.4 or the second 
instruction in the block; numbering of the instructions starts 
from 0 in a block), the number of instructions the processor 
streamed through (2.1 ) and the number of instructions of the 
block the processor did not execute (1.6). The numbers on 
Line 38 are average numbers. The last percentage in lines 34 
to 37, under the other column, is the percentage of the 
corresponding streaming number to the number of l-cache 
misses on Line 27. Dropping out of streaming does not stop 
the refilling of the instruction cache with the block refill number 
of instructions. When streaming is aborted, the processor 
waits till the refilling is finished and then starts executing the 
instruction that caused the drop out. 


Micro TLB Miss Cycles 

24 uTLB misses: 318004 (0.68%/ 0.43%) 

39 uTLB miss cycles: 318004 (0.68%/ 0.43%) (penalty 1) 


When the uTLB does not contain the entry to translate the 
virtual address of an instruction to its physical address, there 
is a penalty (1 cycle) in referring the main TLB and updating 
one of its entries with the appropriate entry for future use. The 
miss cycles in Line 39 are a product of the number of uTLB 


misses (318004) with the penalty (1 ) on Line 39. This penalty 
of 1 cycle can not be changed by the user. The percentages 
in parentheses on Line 39 are the penalty cycles per instruction 
and per cycle, respectively. 
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TLB Miss Cycles 

25 I -TLB misses: 9394 

26 D-TLB misses: 7927 

40 l-TLB miss cycles: 122122 

41 D-TLB miss cycles: 103051 


(0.02%/ 0.01%) 

(0.02%/ 0.01%/ 0.06%) 
(0.26%/ 0.17%) (penalty 13) 
(0.22%/ 0.14%) (penalty 13) 


The penalty cycles for loading the main TLB and/or the 
uTLB with a single page table entry from the main memory is 
given by lines 40 and 41. The penalty for every miss (13 
cycles) is fixed and can not be set by the user. These miss 


cycles do not account for any I/D cache miss penalty cycles 
that might arise while executing the TLB refill algorithm. The 
percentages in parentheses on lines 40 and 41 are penalty 
cycles per instruction and per cycle, respectively. 


Cache Miss Cycles 

27 l-cache misses: 2007056 

28 D-cache misses: 234278 

42 l-cache miss cycles: 18063504 

43 l-cache streaming: -4155427 

44 D-cache miss cycles: 1405668 


(4.29%/ 2.73%) 

(0.50%/ 0.32%/ 2.53%) 
(38.61%/ 24.61%) (penalty 9) 
(-8.88%/ -5.66%) 

(3.0%/ 1.92%) (penalty 6) 


The penalty cycles associated with the l-cache misses and 
the D-cache misses are given by lines 42 and 44. The penalty 
for a miss is the sum of read latency of the main memory (user 
selectable) and the block refill size. In the current simulation 
the read latency is set to be 5 cycles and the block refill size 
ischosentobe4wordsforinstructioncacheand 1 wordforthe 


data cache. If instruction streaming is enabled, the penalty for 
l-cache misses is reduced because of concurrent execution of 
the instructions with the refill of the instruction cache. This 
gain in cycles is given in Line 43. The percentages in 
parentheses on lines 42 to 44 are penalty cycles per instruction 
and per cycle, respectively. 


Partial Word Stores 


10 byte write 

445222 

0.95% 

1 1 half write 

438867 

0.94% 

12 swr 

0 

0.00% 

13 swl 

73 

0.00% 

45 2-cycle SB/SH/SWUSWR: 884162 

(1.89%/ 1.20%) (penalty 1) 


Partial word store instructions take two cycles as the 
processor performs a read-modify-write operation, so the 
penalty for each store is one cycle. The total penalty cycles 
due to these instructions is given in Line 45 (884162). The 


number of 2-cycle stores is the sum of all the partial word 
stores listed from Line 10 to 13. The percentages in 
parentheses on Line 45 are the penalty cycles per instruction 
and per cycle, respectively. 


Write Buffer Penalty Cycles 

46 Write buffer full cycles: 6780214 (14.49%/ 9.24%) 

(average 1 .3 per write) 

47 Write wait cycles: 3093879 (6.61%/ 4.22%) 

(average 1.4 per miss) 


Whenever the write buffer is full, the next write causes the 
buffer to be flushed. The processor waits while this flush is 
carried out. For the current user program, the processor 
waited for 6780214 cycles (Line 46) while the main memory is 
being updated with the contents of the write buffer. The 
average wait penalty per write due to a full write buffer is given 
in second set of parentheses on Line 46 (1 .3). The write buffer 
continually updates the main memory whenever it has some 


data and the memory bus is free. When there is an l-cache 
miss or a D-cache miss, the memory bus is busy supplying 
instructions and data holding off the write buffer writes. The 
number of cycles the writes are kept waiting due to reads from 
the main memory on cache misses is given on Line 47. The 
average number in parentheses is the write wait cycles per I- 
and D-cache misses. 
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Subtotal 

48 Subtotal: 26615177 (56.89%/ 36.26%) 

The subtotal is the total number of penalty cycles minus the penalty cycles to the total instructions and the percentage of 
cycles gained due to streaming. The percentage of the total the total penalty cycles to the total cycles is given in ( )s. 

Instructions 

4 46783524 instructions (46.8M) 

49 Instructions: 46783524 (100.00%/ 63.74%) 

Line 49 gives the total number of instructions of the current might change if the application program is data dependent or 
application program executed. The number of instructions if a different optimization level is used in compiling the program. 

Total Cycles 

3 73398701 cycles (1.569), 4.4s @ 16.7MHz 

50 Total: 73398701 (156.89%/ 100.00%) 

The total number of cycles the processor takes to execute uTLB/TLB misses, cache misses, 2-cycle instructions, write 
the current userprogram isthesumof the numberof instructions bufferflushing and write buffer waits, minus the cycles gained 
(1 cycle/instr) and the total number of penalty cycles due to due streaming. 

TLB Refill Cycles 

51 TLB memory cycles: 111030 (0.24%/ 0.15%) 

For every TLB miss, a TLB refill algorithm gets executed, into account in l-cache misses and D-cache misses. Line 51 
This kernel code is cached and resides in ksegO. The cache gives the number of cycles spent for such cache misses, 
misses occuring while this algorithm gets executed are taken 

Bus Usage 

52 Memory bus cycles: 33976968 (72.63%/ 46.29%) 

The caches are loaded with instructions and data from the transfers. The number of cycles the bus is in usage is given 
main memory and updates are made to the main memory by on line 52. 
the write buffer keeping the memory bus busy with these 

Cache2000 Runtime 

53 simulation runtime: 248.4u 23.4s, 5473.6w/5%, 172 Kinst/s, 225 Krefs/s 

54 DONE! 

The amount of user time in seconds (248.4), the amount of made per second by the cache2000 program in simulating the 
system time in seconds (23.4), the number of instructions memory model are listed on Line 53. These numbers have no 
executed per second and the number of memory references bearing on the performance of the memory model itself. 
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COMPARISON OF MEMORY SUBSYSTEMS tables. In these simulations, it is assumed that the write butter 
The application program is run through difterent memory * s 1 w °rd deep and the main memory read latency is 5 cycles, 
subsystems and execution times are listed in the following 



l-Cache Size 
(Kbytes) 

D-Cache Size 
(Kbytes) 

1- & D-Cache Refill 
(words) 

Instruction 

Streaming 

Run Time 16.67MHz 
(seconds) 

(1) 

8 

0 

1 

OFF 

9.6 

■M 

8 

0 

4 

ON 

8.3 


8 

0 

8 

ON 

8.1 


8 

8 

1 

OFF 

5.9 


8 

8 

4 

ON 

4.4 

■EM 

8 

8 

8 

ON 

4.2 

m 

16 

8 

1 

OFF 

5.1 

ill 

16 

8 

4 

ON 

4.1 

■M 

16 

8 

8 

ON 

4.0 

(10) 

16 

6 

1 


5.1 

(11) 

16 

16 

4 


4.1 

(12) 

16 

16 

8 


4.0 

(13) 

32 

16 

1 

OFF 

4.3 

(14) 

32 

16 

4 

ON 

3.7 

(15) 

32 

16 

8 

ON 

3.7 

(16) 

32 

32 

1 

OFF 

4.2 

(17) 

32 

32 

4 

ON 

3.7 

(18) 

32 

32 

8 

ON 

3.6 

BEH 

64 

32 

1 

OFF 

3.7 

B331 

64 

32 

4 

ON 

3.5 

■ 

64 

32 

8 

ON 

3.5 


64 

64 

1 

OFF 

3.7 

■ml 

64 

64 

4 

ON 

3.5 

Ball 

64 

64 

8 

ON 

3.5 


2854 tbIOl 


Table 1. 


Notice that when streaming is on, doubling the instruction 
cache from 8Kbytes to 1 6Kbytes gives 6% gain in perform- 
ance, 1 6Kbytes to 32Kbytes gives 9.5% gain and 32Kbytes to 
64Kbytes gives 4.2% gain. When streaming is off, doubling 
the instruction cache gives an average 16% gain in perform- 
ance. Also notice that changing the data cache size from 0 to 
8Kbytes gives an average gain of 90%. 

In the following table, the simulations assume a read 
latency of 3 cycles and a 1 word deep write buffer. Instruction 
streaming is turned on. 


■ 

1 Cache 
Size 

(Kbytes) 

D Cache 
Size 

(Kbytes) 

1 &D Cache 
Refill 
(words) 

Run Time 
20.00MHz 
(seconds) 

(D 

32 

8 

1 

3.4 

(2) 

32 

8 

4 

3.0 

(3) 

32 

32 

1 

3.3 

(4) 

32 

32 

4 

3.0 


2854 tbl 02 


Comparing cases 1 and 2, observe that block refill of 4 
words enhanced the performance by 1 3.3%. Between 3 and 
4, the gain is 10%. With 1 word refill, incrementing the data 
cache to 32Kbytes gives a 3% gain. Between 2 and 4, refill 
size contributes little to the overall performance. 

In the following simulations, refill size of both I- and D- 
caches is 1 word. The read latency is 5 cycles. 



1 Cache 
Size 

(Kbytes) 

D Cache 
Size 

(Kbytes) 

Write Buffer 
Depth 
(words) 

Run Time 
20.00MHz 
(seconds) 

mm 

4 

0 

1 

9.1 

131 

4 

0 

4 

8.9 


2854 tbl 03 


Table 3. 


Write buffer, for this application program, gives a boost of 
2.25% to the performance. 



Table 2. 
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PIXSTATS OUTPUT 

The following listing is the output of the Pixstats program. 
The information that is not relevant to performance analysis is 
omitted from the output file. The interlock cycles of the integer 
mult/div of the IDT79R3000 and the interlock cycles of the 
IDT79R3010 add/mult/div units are listed in the Pixstats 
output. These interlock cycles should be added to the number 
of cycles given in the cache2000 output to obtain the exact 
number of cycles for program execution, sin cecache2000 
only evaluates the memory penalties and does not take into 
account of any interlock penalties. Ontheotherhand, Pixstats 
assumes that the application program fits completely in the 
caches. Pixstats is run only once because it is independent of 
the memory subsystem. The number of interlock cycles that 
should be added to the cache2000 cycles is the difference 
between the number of cycles given on the first line and the 
number of instructions on the second line of Pixstats output. 

The Pixstats output file is interspersed with comments for 
explanation. The output file follows. 

pixstats embedded_application: 

49268275 (1 .053) cycles (2.95s @ 1 6.67MHz) — This line 
gives the number cycles the IDT79R3000 and IDT79R3010 
take to execute the application program. The number of 
cycles calculated by Pixstats can differ from the cycles 
calculated by cache2000. In calculating the cycles, Pixstats 
assumes that the application program instructions and data 
completely reside in the caches. In other words, Pixstats 
assumes 1 00% cache hit rate for both I- and D-caches, while 
Cache2000 accounts for the memory overhead. On the other 
hand, pixstats estimates the interlock cycles of the I DT79R3000 
and IDT79R3010. The interlock cycles are not taken into 
account by cache2000. The number in the first set of ( )s is the 
average number of cycles per instruction. The execution time 
is 2.95s at 16.67MHz. 

46783524 (1.000) instructions — This line gives the number 
of instructions executed. Both Pixstats and cache2000 execute 
the same number of instructions. 

9264472 (0.198) loads — This line gives the total number 
of loads to the IDT79R3000 and IDT79R301 0. This is the sum 
of loads on lines 6 and 14 of cache2000 output. 

5111 329 (0.1 09) stores — This line gives the total number 
of stores from the IDT79R3000 and IDT79R301 0. This is the 
sum of stores on lines 8,10,11,12,13, and 16 of Cache2000 
output. 


14375801 (0.307) loads+stores — These are the memory 
referencing instructions executed by the IDT79R3000. 

14558230 (0.311) data bus use — The number of cycles 
the data bus is busy is given here. 

6406647 (0.137) branches. 

70241 33 (0. 1 50) nops —The total number of nops executed 
is given. These nops are due to load delay slots and branch 
delay slots. 

1974000 (0.042) multiply/divide interlock cycles (12/35 
cycles) — The IDT79R3000 has a separate integer multiply/ 
divide unit that takes 1 2 and 35 cycles for integer multiplication 
and division, respectively. Any attempt to read the result of a 
multiply/divide before the operation is complete will cause the 
CPU to interlock until the operation is finished. The number of 
cycles the units we re interlocked is given above. 

385600 (0.008) floating point data interlock cycles. 

9562 (0.000) floating point add unit interlock cycles. 

16676 (0.000) floating point multiply unit interlock cycles. 

372 (0.000) floating point divide unit interlock cycles. 

98541 (0.002) other floating point interlock cycles — The 
above lines give the interlock cycles of the IDT79R301 0 FPA. 

Data interlocks occur because of data dependencies 
betweenvarbusfloatingpointoperations. This usually happens 
because the source operand (register) of some fp operation is 
the destination register of some previous fp operation that has 
not completed. The interlock cycles dueto data dependencies 
between two successive floating point operations are 385600. 

Cycles inwhichthe add/mult/div are interlocked are because 
consecutive fp operations could not be issued. The add, 
multiply and divide units themselves are not pipelined and, 
therefore, a new operation cannot be started before the 
previous one completes. 

0.337 load nops per load — This number indicates that 67% 
of the load delay slots are filled with useful instructions. 

0.356 stores per memory reference — This number implies 
that 35% of the memory references are store operations and 
the remaining 65% are load operations. Many application 
programs typically have a 2:1 ratio for reads to writes. 

0.474 branch nops per branch — 53% of the branch delay 
slots have been successfully filled with useful instructions by 
the assembler. 
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Opcode Distribution: 


c.lt 

35951 

0.08% 

spec 

14871837 

31 .79% 

xor 

29882 

0.06% 

Iw 

6261479 

13.38% 

srlv 

23727 

0.05% 

addiu 

4970770 

10.63% 

xori 

23585 

0.05% 

sw 

4161532 

8.90% 

div 

22679 

0.05% 

Inop 

3119580 

6.67% 

sllv 

22030 

0.05% 

bnop 

3039486 

6.50% 

fcvtw 

19601 

0.04% 

addu 

2227361 

4.76% 

c.le 

19410 

0.04% 

andi 

1938082 

4.14% 

fsub 

17003 

0.04% 

beqz 

1881668 

4.02% 

addi 

14746 

0.03% 

bnez 

1648279 

3.52% 

bft 

14744 

0.03% 

sll 

1524644 

3.26% 

fdiv 

13146 

0.03% 

Ibu 

1315591 

2.81% 

mfhi 

12038 

0.03% 

li 

1255554 

2.68% 

fneg 

9690 

0.02% 

lui 

1244693 

2.66% 

c.eq 

8581 

0.02% 

b 

1030526 

2.20% 

divu 

7802 

0.02% 

sra 

866987 

1.85% 

fmov 

7283 

0.02% 

Ihu 

822784 

1.76% 

sub 

4142 

0.01% 

copl 

759725 

1.62% 

srav 

2869 

0.01% 

jr 

728865 

1.56% 

nor 

2389 

0.01% 

situ 

724597 

1.55% 

lb 

964 

0.00% 

Ih 

607812 

1.30% 

c.olt 

877 

0.00% 

jnop 

603324 

1 .29% 

syscall 

793 

0.00% 

bcond 

536865 

1.15% 

tabs 

736 

0.00% 

bne 

519056 

1.11% 

Iwl 

178 

0.00% 

bgez 

492881 

1.05% 

Iwr 

164 

0.00% 

sb 

445222 

0.95% 

swl 

73 

0.00% 

sh 

43886 

0.94% 

add 

73 

0.00% 

sltiu 

409184 

0.87% 

j 

36 

0.00% 

beq 

405527 

0.87% 




srl 

389980 

0.83% 

The lines 

above give the 

opcode distribution of the 

subu 

333942 

0.71% 

instructions that are executed. 

Notice that 759725 COP1 

jal 

332866 

0.71% 

instructions have been executed. This number and the 

sit 

298293 

0.64% 

percentage are cumulative for all the COP1 instructions. Less 

nop 

261743 

0.56% 

than 2% of 

the instructions 

executed are, therefore, 

Iwcl 

255500 

0.55% 

IDT79R3010 instructions. 


and 

225631 

0.48% 




blez 

202636 

0.43% 

24424 static instructions 


bgtz 

182090 

0.39% 

4414 static basic blocks 


fcvtd 

175925 

0.38% 

5.5 static instructions per basic block 

mflo 

129374 

0.28% 

Static opcode frequency 


slti 

127245 

0.27% 

spec 

6646 

27.21% 

fcvts 

118090 

0.25% 

Iw 

3408 

13.95% 

multu 

105875 

0.23% 

sw 

2438 

9.98% 

jalr 

103051 

0.22% 

addiu 

2157 

8.83% 

mtf 

65771 

0.14% 

addu 

1627 

6.66% 

swcl 

65635 

0.14% 

copl 

1160 

4.75% 

or 

60680 

0.13% 

Inop 

1094 

4.48% 

ori 

52753 

0.11% 

li 

1031 

4.22% 

bff 

50075 

0.11% 

jal 

917 

3.75% 

fmul 

48172 

0.10% 

bnop 

889 

3.64% 

bltz 

43984 

0.09% 

lui 

810 

3.32% 

cff 

40491 

0.09% 

beqz 

616 

2.52% 

ctf 

40046 

0.09% 

Iwcl 

526 

2.15% 

fadd 

38012 

0.08% 

bnez 

518 

2.12% 

mtf 

36121 

0.08% 

Ibu 

514 

2.10% 
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Ih 

500 

2.05% 

sll 

499 

2.04% 

jr 

474 

1.94% 

sh 

464 

1 .90% 

b 

455 

1.86% 

andi 

407 

1.67% 

jnop 

406 

1.66% 

sb 

378 

1 .55% 

sra 

324 

1.33% 

subu 

316 

1.29% 

fcvtd 

274 

1.12% 

swcl 

259 

1.06% 

bne 

256 

1.05% 

situ 

225 

0.92% 

nop 

202 

0.83% 

fcvts 

193 

0.79% 

beq 

189 

0.77% 

Ihu 

187 

0.77% 

sit 

74 

0.71% 

fmul 

117 

0.48% 

mtf 

117 

0.48% 

bcond 

105 

0.43% 

ori 

105 

0.43% 

blez 

104 

0.43% 

fadd 

103 

0.42% 

slti 

100 

0.41% 

srl 

88 

0.36% 

mflo 

80 

0.33% 

bgez 

78 

0.32% 

and 

75 

0.31% 

sltiu 

61 

0.25% 

multu 

57 

0.23% 

bgtz 

50 

0.20% 

mff 

49 

0.20% 

bff 

44 

0.18% 

fdiv 

42 

0.17% 

cff 

37 

0.15% 

ctf 

34 

0.14% 

c.lt 

32 

0.13% 

fsub 

29 

0.12% 

bltz 

27 

0.11% 

jalr 

22 

0.09% 

xori 

20 

0.08% 

div 

20 

0.08% 

xor 

19 

0.08% 

lb 

17 

0.07% 

fcvtw 

16 

0.07% 

c.le 

15 

0.06% 

bft 

15 

0.06% 

addi 

14 

0.06% 

fneg 

13 

0.05% 

fmov 

12 

0.05% 

sllv 

9 

0.04% 

syscall 

9 

0.04% 

or 

9 

0.04% 

c.eq 

9 

0.04% 


divu 

7 

0.03% 

mfhi 

6 

0.02% 

tabs 

6 

0.02% 

srlv 

5 

0.02% 

Iwl 

4 

0.02% 

Iwr 

4 

0.02% 

srav 

4 

0.02% 

sub 

3 

0.01% 

c.olt 

3 

0.01% 

j 

2 

0.01% 

swl 

2 

0.01% 

add 

2 

0.01% 

nor 

1 

0.00% 


The list above gives the number of different kinds of 
instructions the compiler has generated. The total number of 
instructions the compiler generated is 24424 and, in the 
executable, there are 1160 COP1 instructions. The numbers 
given in the list are cumulative of their kind. 

FPA EMULATION OVERHEAD 

The MIPS compilers generate executable code assuming 
the presence of an FPA. When the FPA is not present, or if it 
is not functional, for every FPA instruction, the exception 
handler invokes the floating point instruction emulation 
software. The current state execution information is passed 
onto the emulation software through the Process Control 
Block (pcb). Following the IEEE standards, the emulation 
software decodes the FPA instructions, fetches the operands, 
checks for NaNs, Infinities, Zeros and Denormalized numbers, 
carries out the FPA operation, normalizes the result and, 
finally, returns the result. 

The amount of time it takes to execute the emulation 
software depends on the instruction being emulated, the value 
of the operands and the amount of the emulation code in the 
cache. The following is a table listing of the emulation times 
of the emulation software of RISC/os 4.0 on M/120 with 
IDT79R3000 @ 16.67MHz for the floating point arithmetic 
instructions. The times are in micro seconds. The numbers 
in ( )s are the number of times the emulation is slower than 
actual execution on the IDT79R3010. 


Instruction 

Best Case 
usees 

Avg Case 
^secs 

Worst Case 
fxsecs 

ADD.S 

8.35(70) 

18.31(153) 

25.5(213) 

ADD.D 

8.35(70) 

18.31(153 

25.5(213) 

SUB.S 

8.35(70) 

18.31(153) 

25.5(213) 

SUB.D 

8.35(70) 

18.31(153) 

25.5(213) 

MUL.S 

9.48(40) 

23.08(96) 

42.91(179) 

MUL.D 

9.48(32) 

23.08(77) 

42.91(143) 

DIV.S 

9.48(13) 

23.08(32) 

42.91(60) 

DIV.D 

9.48(9) 

23.08(20) 

42.91(38) 
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The best case occurs when most of the emulation software 
is in the cache and the operands are either zero or infinity. On 
the other hand, the worst case arises when very little of the 
emulation code is in the cache and the operands are 
denormalized. The average times are good indicators of the 
overhead for regular data. To determine the emulation 
overhead for an application program, the output of the Pixstats 
must be used. The static and the dynamic opcode distribution 
for coprocessor 1 listed in the output and the times given 
above helpthe designer estimate the total emulation overhead. 


CONCLUSION 

The software tools — cache2000, Pixie and Pixstats — 
allowtheuserto accurately project the performance of different 
IDT79R3000-based systems. The application program needs 
to be compiled only once. cache2000 executable must be 
created for every proposed memory model. All these models 
can be run in parallel as background jobs. To find the interlock 
cycles of the IDT79R3000 and IDT79R3010, Pixstats should 
be run only once. The cache2000output clearly points out the 
tunable parameters of the memory subsystem. The Pixstats 
output provides information on integer and floating point 
interlocks and static/dynamic opcode distribution. 
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APPENDIX A 


cache2000{\spp) 

SYSTEM PROGRAMMER’S MANUAL 
Name 

cache2000 — analyze cache misses for an M2000 system 
(IDT79R2900/R2950) 

Synopsis 

Cache2000 [options] 

Description 

cache2000 analyzes a memory reference stream produced 
by a program pixified with -idtrace. To use cache2000, first 
use Pixie(1 ) to translate and instrument the executable object 
module for the program. Use pixie’s -idtrace option. Next, 
execute the translation on an appropriate input using 
makepipe(l). The pixified program will output the addresses 
and types of memory references to Unix file descriptor 19. 
With makepipe this trace can be fed into the standard input of 
cache2000 for simulation. Example: 

makepipe 1 9 foo. pixie fooarg ‘<‘ fooinput V foooutput 
TON 

/usr/local/bin/cache2000 -comment ‘loo fooarg” 
V foo.cache2000 

cache2000 differs from cache26(1) in that it models the 
IDT79R2950 memory board, which supports page mode 
writes and 16-word cache refill. The IDT79R2900 also uses 
the IDT79R3000 2-cycle partial word store option to avoid 
cache invalidation. The cache sizes are hardwired in the 
source to 64K bytes. The data cache is write-thru, with 
IDT79R2020 write buffering. cac/je2000simulatesthememory 
subsystem. Accurate performance predictions must add the 
stall cycles predicted by pixstats(l). 

-comment string 

Include string in the output. This is useful for 
associating the output with the program and input 
used to generate it. 

-print N 

Set the interval for periodic statistics printout. N is the 
interval in millions of instructions. Default is 20 million 
instructions. 

-flush N 

Set the cache flush interval. N is the interval in 
millions of instructions. Default is 1 million instructions. 

-[no]random_flush 

Flush cache at random intervals (Poisson distribution). 
Default: -norandomjlush. 


-cycle ns 

Use a ns cycle time when converting cycle counts to 
seconds. Default is 40ns. 

-clock MHz 

Use a 1000/MHz cycle time when converting cycle 
counts to seconds. Default is 25.0MHz. 

-wbsize N 

Simulate a N-deep IDT79R2020 write buffer. Default 
is 4 deep. Maximum is 8 deep. 

-readjatency N 

Instruction and data cache misses take N+16 extra 
cycles. Default is 12 cycles. 

-idle_word N 

Set the IDTR2950 memory board idle word write time 
to N cycles. Default is 4 cycles. 

-page_write N 

Set the IDTR2950 non-idle, page mode write time to 
N cycles. Default is 2 cycles. 

-nonpage_write N 

Set the IDTR2950 non-idle, non-page mode write 
time to N cycles. Default is 6 cycles. 

See Also 

pixie(1 ), pixstats(1 ), makepipe(1 ), cache23(1 ), cache26(1 ), 

The MIPS System Programmer’s Reference. 

These are the cache2000 parameters that can be modified 
to model different memory systems. The values assigned to 
these parameters are just examples. 

I* cahce parameters 7 

I* instruction refill size, in words 7 
#ifndef Lrefilljog 

# define i_refill_log 4 

#endif 

#define i_refill_size (1«i_refill_log) 

I* instruction cache size, in words 7 
#ifndef i_size_log 

# define i_size Jog 14 

#endif 

#define i_refill_size (1«i_refill_log) 

I* instruction cache size, in words 7 
#ifndef i_size_log 

# define i_size Jog 14 

#endif 

#define i_size (1«i_sizeJog) 
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I* data refill size, in words 7 
#ifndef d_refill_log 

# define d_refill_log 4 
#endif 

#define d_refill_size (1«d_refillJog) 

I* data cache size, in words 7 
#ifndef d_size_iog 

# define d_size_log 14 
#endif 

#define d_size (1«d_size_log) 

I* byte gathering 7 
#ifndef byte_gathering 

# define byte_gathering 0 
#endif 

I* read conflict checking in write buffer 7 
#ifndef read_conflict_check 

# define read_conflict_check 0 

#endif 


/* instruction streaming 7 
#ifndef istreaming 
# define istreaming 1 
#endif 

/* memory parameters 7 
private unsigned wbsize = 4; 
private unsigned readjatency = 13; 
private unsigned idle_write_time = 4; 
private unsigned page_write_time = 2; 
private unsigned nonpage_write_time = 6; 
private unsigned byte_extra_write_time = 4; 

private char ‘comment = NULL; 
private boolean random_flush = false; 
private unsigned printjnterval = 20000000; 
private unsigned flushjnterval = 1000000; 
private double random_flush__parameter; 
private double cycletime = 40e-9; 
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PIXIE(l-SysV) 

RISC/OS REFERENCE MANUAL 
Name 

pixie — add profiling code to a program 

Synopsis 

pixie in_prog_name [options] 

Description 

Pixie reads an executable program, partitions it into basic 
blocks, and writes an equivalent program containing additional 
code that counts the execution of each basic block. {A basic 
block is a region of the program that can be entered only at the 
beginning and exited only at the end). Pixie also generates a 
file containing the address of each of the basic blocks. 

When you run the pixie-generated program, it will (provided 
it terminates normally or via a call to exit(2)) generate a file 
containing the basic block counts. The name of the file is that 
of the original program with any leading directory names 
removed and “.Counts” appended. prof(1) and plxstats(l) 
can analyze these files and produce a listing of profiling data. 

-[no]quiet 

[Permits] or suppresses messages summarizing the 
binary-to-binary translation process. 

Default: -noquiet. 

-{no]branchcounts 

-branchcounts inserts extra countersto track whether 
each branch instruction is taken or not taken. When 
this option is used, plxstats will automatically print 
more statistics. Default: -nobranchcounts. 

-[no]idtrace 

[Disable] or enable tracing of instruction and data 
memory references, -idtrace is equivalent to using 
both -itrace and -dtrace together. Default: -noidtrace. 

-[no]itrace 

[Disable] or enable tracing of instruction memory 
references. Default: -noitrace. 


-[no]dtrace 

[Disable] or enable tracing of data memory references. 
For the moment, -dtrace requires -itrace. Default: 
-nodtrace. 

-idtracejile number 

Specify a UNIX file descriptor number for the trace 
output file. Default: 19. 

-bbaddrs name 

Specify a name for the file of basic block addresses. 
Default is to remove any leading directory names 
from the in_prog_name and append “.Addrs”. 

-bbcounts name 

Specifies the full filename of the basic block counts 
file. Default: objfile.Counts. 

-mipsl 

Use the MIPS1 instruction set (IDT79R2000, 
IDT79R3000) for output executable. This is the 
default. 

-mips2 

Use the MIPS2 instruction set (a superset of MIPS1) 
for output executable. 

See Also 

prof(1), pixstats(l). 

The MIPS Languages Programmer’s Guide. 

Bugs 

The handler function address to the signal system calls is 
not translated, and so programs that receive signals will not 
work pixified. 

Programs that call vfork() will not work pixified because te 
child process will modify the parent state required for pixie 
operation. Use fork() instead. 

Pixified code is substantially larger than the original code. 
Conditional branches that used to fit in the 16-bit branch 
displacement field may no longer fit, generating a pixie error. 
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PIXST ATS(1 -SysV) 

RISC/OS REFERENCE MANUAL 

Name 

pixstats — analyze program execution 

Synopsis 

pixstats program [options] 

Description 

Pixstats analyzes a program’s execution characteristics. 
To use pixstats, first use PIxie(1 ) to translate and instrument 
the executable object module forthe program. Next, execute 
the translation on an appropriate input. This produces a 
.Count file. Finally, use pixstats to generate a detailed report 
on opcode frequencies, interlocks, a mini-profile, and more. 

-cycle ns 

Assume a ns cycle time when converting cycle counts 
to seconds. 


-r201 0 

Use r2010 floating point chip operation times and 
overlap rules. This is the default. 

-r2360 

Use r2360 floating point board operation times and 
overlap rules. 

-disassemble 

Disassemble and show the analyzed object code. 
See Also 

pixie(l), prof(1), The MIPS Languages Programmer’s 
Guide. 

Bugs 

Pixstats models execution assuming a perfect memory 
system. Cache misses etc. will increase above the pixstats 
predictions. 
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By Danh Le Ngoc, Ignacio Osorio and Avigdor Wlllenz 

INTRODUCTION • Read-Write buffers, explains what read and write buffers 

The objective of this application note is to describe the use are < an d how they function in a R3000/1 system, 
of the IDT 73200 multilevel pipeline register as the write buffer * Implementing R-W Buffers, describes howto implement 
and read bufferforthe R3000/1 RISC processor.The following the IDT73200 as read and write buffers. Buffer depths are 
topics are discussed: also discussed in this section. 

• The IDT73200 Multilevel Pipeline Register, presents a * A Typical System, provides an example of read-write 
brief description of general characteristics and configura buffers using the IDT73200, within a RISC system. It also 
tions of the multilevel pipeline register. presents the control logic and PAL equations to operate the 

IDT73200 as read and write buffers. 


DO-15 




YO-15 5E 

Figure 1. Block Diagram of the IDT73200 


USING THE IDT73200 MULTILEVEL 
PIPELINE REGISTERS AS READ AND 
WRITE BUFFERS WITH R3000/1 


APPLICATION 

NOTE 

AN-65A 



©1990 Integrated Device Technology, Inc. 


9.10 


2 


6/90 










USING THE IDT73200 OR IDT73210 AS 
READ AND WRITE BUFFERS WITH R3000 


APPLICATION NOTE AN-65 


13 

12 

11 

10 

MNEMONIC 

FUNCTION 

PIPELINE 

LEVEL 

0 

0 

0 

0 

LDA 

D0-15->A 

1 

0 

0 

0 

1 

LDB 

D0-15->B 

1 

0 

0 

1 

0 

LDC 

D0-1 5->C 

1 

0 

0 

1 

1 

LDD 

D0-15->D 

1 

0 

1 

0 

0 

LDE 

D0-15->E 

1 

0 

1 

0 

1 

LDF 

D0-15->F 

1 

0 

1 

1 

0 

LDG 

D0-15->G 

1 

0 

1 

1 

1 

LDH 

D0-15->H 

1 

1 

0 

0 

0 

LSHAH 

D0-1 5->A->B->C->D->E->F->G->H 

8 

1 

0 

0 

1 

LSHAD 

D0-1 5->A->B->C->D 

4 

1 

0 

1 

0 

LSHEH 

D0-1 5->E->F->G->H 

4 

1 

0 

1 

1 

LSHAB 

D0-1 5->A->B 

2 

1 

1 

0 

0 

LSHCD 

D0-1 5->C->D 

2 

1 

1 

0 

1 

LSHEF 

D0-15->E->F 

2 

1 

1 

1 

0 

LSHGH 

D0-15->G->H 

2 

1 

1 

1 

1 

HOLD 

HOLD ALL REGISTERS 



THE IDT 73200 MULTILEVEL PIPELINE 
REGISTER 

The IDT 73200 is a high-speed, low-power Programmable 
Multilevel Pipeline Register. It has a dedicated 16-bit input 
port and a dedicated 1 6-bit output port. 

As shown in figure 1, the IDT73200 contains eight 16-bit 
registers which can be configured as one 8-level, two 4-level, 
four 2-level, or eight 1 -level pipeline registers. Data at the input 
port DO-1 5 can be written into any of the eight registers under 
control of the load control: 10-3. Figure 2 illustrates the load 
control for the input port . 

An eight-to-one output multiplexer allows data to be read 
on the Y-bus from any of the eight registers using the output- 
selection control: SO-2. Figure 3 illustrates the output control. 

READ-WRITE BUFFERS 

As shown in the Figure 4 , a high-speed computer system 
consists of a R3000/1 chip set, high-speed cache, write buffer, 
read buffer , I/O devices, and main memory. Since the main 


processor supports a write-through cache policy, all data 
written into the data cache must also be written into the main 
memory to maintain the cache coherency. Due to the 
data-rate mismatch between the high-speed processor bus 
(33MHz -> 240Mbytes/sec) and slow speed main memory 
(10-15MHz ->10-40 Mbytes/sec), a write buffer and a read- 
buffer are required. The write buffer is an elastic buffer which 
is used to capture addresses and data at the cache speed. At 
the other side of the write buffer, data is transfered into the 
main memory at the system memory speed. 

When a load operation causes a cache miss, a main 
memory read is initiated. Two types of main memory read are 
supported on the R3000/1 : single word transfer and multiple 
word transfer. In either case, a read-buffer is used to capture 
data from the system memory at memory speed. Then data is 
written into the cache at the cache speed. The depth of the 
write buffer and the read-buffer are dependent on different 
factors such as processor speed, system memory speed, bus 
protocol and the application. 
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Figure 4. Simplified Block Diagram of a High-Speed RISC System 


IMPLEMENTING R-W BUFFERS 

As previously described in section 1 , the IDT 73200 is like 
a high-speed synchronous memory with a depth that is 
programmable from 1 to 8 deep . Therefore, a write buffer and 
read buffer for the high performance R3000/1 system can be 
easily implemented with the help of the IDT 73200. 

Figure 5 illustrates adetailed R3000/1 system which consists 
of the R3000/1 chip set, write buffer, read buffer, IDT49C465 
Flow-thruEDC™, system main memory, and several state 
machines to control the main memory interface. In this scheme, 
the data bits together with the parity bits flow from the main 
memory through the EDC device for error detection and 
correction. When an error is detected, the EDC informs the 
read buffer control through an error feedback path. 


In figure 5 the write buffer consists of two paths: address 
and data. The address path (34-bit) is created with three IDT 
73200s to capture the address, tag , and the access type bits. 
The data path of the write buffer (32-bit), is formed by two 
IDT73200. Data coming from the CPU is buffered into the 
“data path" write buffer prior to being written into the main 
memory. 

The read buffer in figure 5 consists only of a “data path” 
(36-bits) which includes the required data parity on the 
R3000/1 system. The IDT 49C465 high speed Flow-thruEDC 
can be used to maintain the data integrity of the system main 
memory. Also, parity bits are generated with the help of the 



IDT49C465 Flow-thruEDC. 
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Figure 5. Detailed R3000 System with Read and Write Buffer 
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BUFFER DEPTH 

As discussed earlier, the IDT 73200 can be configured for 
different depths: eight 1 -level, four 2-level, two 4-level or one 
8-level deep registers. This feature makes the 73200 particu- 
larly flexible in Read/Write buffer applications. 

The depth of the write buffer is programmable using the 
load-control and the output selection. A single IDT 73200 can 
be programmed to a buffer depth of 1 to 8. A deeper write 
buffer can be implemented by cascading several devices in 
depth as shown in the figure 6. The right depth depends on the 
application program and/or hardware requirements. 
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Figure 6. 16-Deep and 32-Wide Write Buffer Using IDT73200s 


With the help of the write buffer, the CPU can write to the 
memory without regard to the memory speed. However, if 
consecutive (or back to back) write operations take place, the 
write buffer would eventually become full and cause the CPU 
to stall. Naturally, this could present a problem in high perform- 
ance systems. A logical solution to CPU stalls is to increase 
the depth of the write buffer. 


Typical write buffer depths are two and four levels. How- 
ever, high end applications with intensive memory access 
may require deeper write buffers, i.e., eight or sixteen levels 
as shown in figure 6. 

Read buffer depth design issues are somewhat different 
from those of the write buffers. For example, when an l-cache 
miss occurs we are faced with the question of “how many 
blocks to refill in the l-cache? “To answer this question, we 
should recall that the miss rate is determined by the cache size 
and the block size. Therefore, the block size determines the 
size of the read buffer. Thus to bring 16 words from memory 
into the l-cache would require a 16 level deep read buffer. 

Fetching small blocks of instructions using a shallow read 
buffer implies constantly fetching instructions and therefore 
stalling the CPU for several cycles. Depending on the applica- 
tion, this could impose significant penalty on system perform- 
ance. Due to program locality (sequentiality of instructions), 
we would benefit most by fetching a large block. Deep read 
buffers for l-cache are therefore an appealing solution. Typi- 
cal read buffer depths are 4 levels; high end applications could 
consider from 8 to 16 levels read buffers. 

D-cache fetching, on the other hand, is random in nature 
and typical schemes prefer a 1 -level deep read buffer. 

The flexibility of the 73200 allows instantaneous 
re-configuration when fetching for the l-cache and then for the 
D-cache. For example, we could have an R3000/1 initialized 
for 1 6 words l-cache fetching and 1 word D-cache fetching and 
still use the same read buffer. This can be accomplished 
through a read buffer controller capable of configuring the 
73200 to different depths. 

Lets now discuss two popular read buffer configurations. 

a) 1 -level deep Read Buffer & 

b) 8-level deep Read Buffer 


1-LEVEL DEEP READ BUFFER 

One-level deep read buffers can be used in high 
performance systems where the data transfer between the 
main memory and the CPU is efficiently handled. This can be 
accomplished through a sophisticated memory scheme, like 
interleaving, and supported by a fast DRAM architecture. 
Such a scheme minimizes the transfer rate mismatch 
between the CPU and the main memory. One-level deep read 
buffers can also be applied in low performance systems where 
the penalty in fetching one word at a time is not significant. 


8-LEVEL DEEP READ BUFFER 


This configuration can be used in a general purpose 
system. An 8-level deep write buffer offers the benefit of 
effective data rate capture from the R3000/1 to the main 
memory. The 8-level Read Buffer is convenient for slow main 
memory systems. 
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Figure 7. R3000/1 System with the Read-Write Buffers 


A TYPICAL SYSTEM 

Figure 7 shows the interconnections among the R3000/1 , 
Instruction and Data Cache, Read/Write Buffers, control state 
machine, and the system memory. The read and write buffers 
are built from multilevel pipeline registers denoted by the IDT 
73200. The control state machine represents the logic needed 
to drive the read and write buffers. 

WRITE BUFFER INTERFACE 

A write buffer, as discussed earlier, transfers data from 
cache to main memory and provides address bits to select 
memory locations. This is illustrated in Figure 8: one write 
buffer isdedicated to pass address bits and the othertransfers 
data to the main memory. Therefore, the write buffer labeled 
“address” is activated in both memory reading and memory 
writing operations. 

As seen in Figure 8, the address path carries address, 
tag and Acc type bits. Notice that the write buffer labeled 
“Address”, is formed by two IDT 73200. The first 73200 
captures Address low 0-13 and AccTyp 0,1. The second 
73200 captures Adr High 1 4-29 and Tag 0,1 . 

The Data path, as shown in figure 8, carries data and parity 
bits. The data write buffer uses two IDT 73200. They latch 32 
data bits from the cache and transfer them to a memory 
location selected by a memory controller. Notice that parity 
bits can be generated using the IDT49C465 when data is 
flowing from the write buffer to the system memory. 


A situation of interest in deep write buffers is the following: 
The CPU requests reading data from a memory location that 
is about to be updated by the write buffer. The potential 
problem is clear: reading data that hasn’t been updated yet. 
To avoid this problem, write buffer systems use conflict 
checking schemes. 

A common “conflict checking scheme” is implemented by 
comparing addresses of memory locations to be read and 
written by the read/write buffers. When an address match is 
found, a match signal is send to the CPU. This solution may 
involve using more hardware to implement such scheme. 
Another approach is ‘Hushing”. To simplify the design, the 
write buffer is ‘Hushed”, i.e. , all pending writings are placed in 
the main memory before any read buffer operation takes 
place. Such is the case in figure 8 where no additional 
hardware was needed. 

Figure 8 shows the associated control circuitry to drive the 
write buffer. Notic e that writ e buffer “data” and AdrHi (14-31) 
are clocked at the SYSOUT signal, wh ereas AccTyp (0:1 ) and 
adr-Lo(0:13) are clocked at SYSOUT. 

WRITE BUFFER CONTROLLER 

The write buffer controller is internally driven by two count- 
ers: The l-counter selects the load operation for the input to 
the 73200 . The SEL-counter selects the register to be read in 
the 73200 output. The write buffer controller also takes care of 
the ‘Hushing” scheme. 
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Figure 8. Write Buffer Interface 


The PAL equations for the write buffer controller are: 
MODULE WB_CONT ; 


TITLE WB CONT; 


Table; 



TYPEMMI 16R4; 



X NOT = 

ICE3 XOR SELC3; 

Inputs: 



LWR NOT:= 

LRD AND WBEMPTY AND WRACQ 

IC3 

Node[pin2]; 



AND RESETOR LRD AND! MEMWR 

SELC3 

Node[pin3]; 



AND WRACQ AND! WBEMPTY AND 

WB EMPTY 

Node[pin4]; 



RESET; 

WRACQ 

Node[pin4j; 




MEMRD 

Node[pin5]; 


LRD NOT := 

(IWBEMPTY AND IMEMRD AND 

WBFULL 

Node[pin6]; 



EWR AND RESET) OR (!LRD AND 

MEMWR 

Node[pin7]; 



IMEMRD AND RESET); 

RESET 

Node[pin9]: 







WBCEN NOT = 

WBFULL; 

LRD 

Node[pin15]; 




LWR 

Node[pin14]; 


ICE NOT = 

(IMEMWR AND WBFULL) 





OR( IMEMRD AND IWBEMPTY); 

Outputs: 





LRD 

Node[pin15]; 


SELCE NOT- 

(!LWR AND IWRACQ) OR 

LWR 

Node[pin14]; 



(!LRD AND MEMRD); 

X 

Node[pin19]; 

END; 



WBCEN 

Node[pin12]; 

END WB_CONT. 


ICE 

Node[pin13]; 




SELCE 

Node[pin18]; 
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The PAL equations for the 

-counter are: 

SELC3 

SELC2 

MODULE l-COUNTER; 


SELC1 

TITLE 1 COUNTER; 
TYPEMMI 16R4; 


SELCO 

SO 

Inputs; 


SI 

X 

Node[pin2]; 

S2 

1C 

Node[pin3); 

Outputs; 

ICE 

Node[pin4]; 

SO 

R3KRST 

Node[pin5j; 

SI 

IC3 

Node[pin17] 

SELC3 

IC2 

Node[pin16] 

SELC2 

IC1 

Node[pin15] 

SELC1 

ICO 

Node[pin14] 

CO 

Outputs; 


S2 

IC3 

Node[pin17] 

C 

IC2 

Node[pin16] 

Table; 

IC1 

Node[pin15] 

SELCO NOT: 

ICO 

Node[pin14] 


WBFULL 

Node[pin19] 


WB EMPTY 

Node[pin12] 


Table; 


SELC1 NOT: 

ICO NOT := 

(ICO AND MCE) OR (IICO 


AND ICE) OR IR3KRST; 


Node[pin17] 
Node[pin1 6] 
Node[pin15] 
Node|pin14] 

Node[pin18] 

Node[pin13] 

Node[pin19] 

Node[pin18] 

Node[pin13] 

Node[pin17] 

Node[pin16] 

Node[pin15] 

Node[pin14] 

Node[pin19] 

Node[pin12] 


(SELCO AND ISELCE) OR 
(ISELCO AND SELCE) OR 
IR3KRST; 

(ISELCO AND ISELC1 AND 
ISELCE) OR (SELCO AND 
SELC1 AND ISELCE) OR 
IR3KRST; 


IC1 NOT:= (IICO AND MCI AND IIC1AND 



MCE) OR (ICO AND IC1 AND 

MCE) OR (MCI AND ICE) OR 
IR3KRST; 

SELC2 NOT:= 

(ISELCO AND ISELC1 AND 
ISELC2 AND ISELCE) OR 
(SELCO AND SELC1 AND 
SELC2 AND ISELCE) OR 

IC2 NOT:= 

(IICO AND MCI ABD IIC2 

ABD MCE) OR (ICO AND IC1 

AND IC2 AND MCE) OR 
(IICO AND IC1 AND IIC2 AND 

MCE) OR (ICO AND MCI 

AND IIC2 AND MCE) OR 
(IIC2 AND ICE) OR 


(ISELCO AND SELC1 AND 
ISELC2 AND ISELCE) OR 
(SELCO AND ISELC1 AND 
ISELC2 AND ISELCE) OR 
(ISELC2 AND SELCE) OR 
IR3KRST; 


IR3KRST; 

SELC3 NOT:= 

(ISELC2 AND ISELC3 AND 
ISELCE) OR (ISELC1 AND 

IC3 NOT := 

WBFULL NOT = 

(IIC2 AND MC3 AND MCE) 

OR (MCI AND IC2 AND IIC3 

AND MCE) OR (IICO AND 

IC1 AND IC2 AND IIC3 AND 

MCE) OR (ICO AND IC1 AND 

IC2 AND IC3 AND MCE) OR 
(MC3 AND ICE) OR 

IR3KRST; 

IIC AND IX; 


SELC2 AND ISELC3 AND 
ISELCE) OR (ISELCO AND 
SELC1 AND SELC2 AND 
ISELC3 AND ISELCE) OR 
(SELCO AND SELC1 AND 
SELC2 AND SELC3 AND 
ISELCE) OR (ISELC3 AND 
SELCE) OR IR3KRST; 

WEMPTY NOT= 

IIC AND X; 

SO NOT = 

(ICO XOR SELCO); 

End; 


SI NOT = 

(IC1 XOR SELC1); 

End l_Counter; 


S2 NOT = 

(IC2 XOR SELC2); 


The PAL equations for the SEL-counter are: 

MODULE SEL_COUNTER; 

TITLE SEL_COUNTER; 

TYPEMMI 1 6R4; 

Inputs; 


SELCE 

Node[pin2] 

ICO 

Node[pin3] 

IC1 

Node[pin4] 

IC2 

Node[pin5] 

R3KRST 

Node[pin6] 


C NOT = SO AND SI AND S2; 
End; 

End SEL_Counter; 
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DATA PATH 



MEMORY CONTROL PATH DATA PATH 


Figure 9. Read Buffer Interface 


READ BUFFER INTERFACE 

When reading from the main memory to the cache, the 
R3000/1 sends a memory read signal to the control state 
machine, represented in Figure 9 as the Read Buffer Control 
Logic. Once the signal has been acknowledged, the R3000/1 
places the address, tag, and data size in the write buffers. 
Internally, the 73200 registers capture this information at 
R3000/1 clock rate with load and output configurations deter- 
mined by the read buffer controller. Once the address is 
available in the address bus, the controller will then drive 
memory signals to initiate the memory transfer at memory 
clock rate into the read buffer. 


Outputs; 

LRD 

LWR __ 

WB_CLK_DIS_ 

WB_DATA_OE 

CMEMRD 

CCMEMRD 


WRBUSY 


Table; 


LWR NOT := 


LRD NOT := 


Node[pin15]; 

Node[pin14]; 

Node[pin19]; 

Node[pin12]; 

Node[pin13]; 

Node[pin18]; 

Node[pin16]; 

URD AND IMEMWR AND 
RESET OR !LWR AND 
WRACQ; 

LWR AND IMEMRD; 


READ BUFFER CONTROLLER 

The read buffer controller monitors the flow of data within 
the Read Buffer by programming the 73200 internal registers 
to the appropriate load mode and memory clock frequency. 
Finally, the controller selects the output registers at such 
speed to match the R3000/1 frequency. 

The PAL equations for the read buffer controller are: 


MODULE W/RB CONT; 


TITLE W/RB CONT; 


TYPEMMI 16R8; 


Inputs; 


WRACQ 

Node[pin5]; 

MEMRD 

Node[pin6]; 

MEMWR 

Node[pin8]; 

RESET 

Node[pin9]; 

LRD 

Node[pin15] 

LWR 

Node[pin14] 

WB CLK DIS 

Node[pin19] 

CMEMRD 

Node[pin13] 

CCMEMRD 

Node[pin18] 

WRBUSY 

Node[pin16] 


WRBUSY NOT := IMEMWR OR 

IWRBUSY AND WRACQ 
OR IMEMRD; 

WBCEN NOT := IMEMWR OR !WB_CLK_DIS 

AND WRACQ OR IMEMRD 
AND CCMEMRD; 

WB_DATA_OE NOT:= LRD AND IMEMWR OR 
{need to be inverted} 

ILWR AND WRACQ; 

CMEMRD NOT := IMEMRD; 

CCMEMRD NOT:= CMEMRD; 

End; 

End W/RB_CONT; 

CONCLUSION 

As the speed of the processor increases, write- and read 
buffers must also become faster and deeper. The high-speed 
multi-level pipeline register IDT 73200 meets that challenge 
by providing a fast and flexible data path to suit various high- 
speed RISC and CISC processors. 
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By V. S. Ramaprasad 


INTRODUCTION 

In this application note, the design of one deep read and 
one deep write buffer to be used in an R3000 system is 
described with boolean equations and timing diagrams. The 
boolean equations are for the control signals of the read and 
write buffers and the main memory interface. This control 
logic can be implemented with any PLD. The syntax chosen 
to describe these equations is simple and it is not associated 
with any PLD programming software. The timing diagrams 
explain the various states during the operation of one deep 
read and write buffers. Also described in this application note 
are the other possible configurations of implementing read 
and write buffers with IDT7321 Os. These components can be 
used as two deep read and one deep write , and one deep read 
and two deep write buffers. Beforethe application is presented, 
the features of 73210 are described and a summary of the 
memory interface signals of R3000 is given. 

R3000 based systems require read/write buffers between 
the CPU and the main memory due to memory bandwidth 
mismatch. The main memory system supplies the instructions/ 
data through a read buffer. The CPU makes the data updates 
to the main memory through a write buffer. The speed 
differences between the CPU, the caches and the main 
memory that typically exist in many systems demand the use 
of at least one level deep read and write buffers. The use of 
these buffers isolates the caches from the rest of the memory 
system. They also limit the physical length of the address and 
data lines and serve as drivers to the rest of the system. 

The gain in performance by increasing the depth of the read 
andthe write buffers is completely dependent onthe application 
program being executed. By modeling memory subsystems 
with different depths of read/write buffers (using the System 
Programmer’s Package tools forthe R3000) and running the 
application program on the model, the designer can make the 
trade-off between the cost and the depth of the buffers. For 
high performance systems with sophisticated main memory 
schemes like interleaving, and for systems with fast DRAM 
architectures like Page Mode, or Static Column Mode, a one 
deep read buffer might satisfy the transfer rate of the processor. 


For low performance systems, where the penalty of fetching 
one word at a time is not significant, and for applications with 
infrequent successive writes, a one deep write buffer might 
also deliver optimal performance. 

In systems where one-level deep read and write buffers 
proved to be sufficient, a bidirectional register can be utilized 
to serve as both read and write buffers. The 8-bit bidirectional 
register, IDT7321 0, with parity checking and parity generation 
is an ideal candidate for this purpose. This bidirectional 
register also allows the designerto build a two-level deep read 
buffer and one level deep write buffer, or one- level read buffer 
and two-level write buffer configurations. Using IDT73210 
reduces the parts that are needed for parity generation. Also, 
by clocking in the lower address bits and the higher address 
bits with separate clocks, the designer can eliminate latching 
the address low bits. 

IDT73210 FEATURES 

Figure 1 shows the features of IDT7321 0 with all the control 
signals and data paths. It is a bidirectional buffer with separate 
output enables and clock enables. Data is registered with the 
same clock in both directions. There is a single data path from 
port A to port B. The 8-bit data and the parity bit are clocked 
through register X. The POLARITY signal is used to select 
even or odd parity generation. Even parity checking is done 
on the data, and a parity error is indicated by PERRA. The 8- 
bit data and the parity bit are enabled through a tri-stateable 
buffer to port B. 

There are two data paths from port B to port A. A multiplexer 
controlled by SEL selects a path. Even parity checking is done 
in both the paths and parity error is indicated by PERRB. The 
first path is through latch W and register Z. In this path bit W8 
is complemented by POLARITY to yield either even or odd 
parity.The second path is through registers Y & Z and even 
parity is generated on the data. The two registers in the 
second data path provide the user with two- level deep 
buffering. The 9-bit output is enabled through a tri-stateable 
buffer to port A. 


6/90 


©1990 Integrated Device Technology, Inc. 


9.10 


ii 





USING THE IDT73200 OR IDT73210 AS 
READ AND WRITE BUFFERS WITH R3000 


APPLICATION NOTE AN-65 


Ao-fl 


AOE 


PERRB 


VCC GNDo-2 


AEN 

CP 


POLARITY 



BEN 

LE 

SEL 


PERRA BOE 


Bo-s 

Figure 1. IDT73210 
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AdrLo (0:17) <: 
AcTyp2 •*- 
AcTypI •*— 
AcTypO ■*- 
BusErr — 
CpCondO — 
SysOut ■*- 



Figure 2. Memory Interface Signals 


C> Tag (16:31) 
£> Data (0:31) 
— ► MemRd 

— RdBusy 
— ► MemWr 

— WrBusy 
-► XEn 

2647 drw 02 


IDT79R3000 MEMORY INTERFACE 

The R3000 has interfaces to the main memory through the 
asynchronous memory bus. The output signals indicate the 
nature of operation that the R3000 is performing. The input 
signals are used to indicate the termination of a stall, block 
refills, and to cause exception processing. 

The figure above shows the signals used to interface to 
main memory. The address bus is split into AdrHi and AdrLo. 
The AdrHi bus is also used as the Tag bus for cache reads and 
therefore is shown as bidirectional. 


MemRd: This signal indicates the entry into the stall on 
a read operation. It is an active-low output signal. This 
output signal of the R3000 is used by the state machines 
to enter a read state and signal the memory system that 
the R3000 accepts data from the supplied 32-bit address. 
For one word refill, MemRd is deasserted by the R3000 
one cycle afterthe RdBusy signal is deasserted indicating 
that the required data is ready. The deassert ion of 
MemRd signals the end of a read stall. MemRd stays 
asserted during the entire stall cycles. 

RdBusy: This input signal to the R3000 is used to enter 
and terminate read stall cycles. The deassertion of 
RdBusy terminates stall cycles and the R3000 enters a 
fixup one cycle later during single word loads or it enters 
refill cycles in case of multiple word loads. RdBusy 
assertion and deassertion is sampled by the R3000 in 
phase 1 of the clock cycle. 

XEn: This active-low, output signal is used to enable the 
output of the read buffer in refill and fixup cycles. 


MemWr: This out put sign al is asserted low for store 
operations. Unlike MemRd, this signal is active for only 
one cyc le as are the associated data and addresses. 
MemWr is used to enter a write state. 


WrBusy: In order to create a write stall, this input signal 
to the R3000 h as to be asserted low during the cycle in 
which MemWr is asserted. The deassertion of WrBusy 
terminates a write stall and the R3000 enters the fixup 
cycle. In the fix up cycle, the last write opera tion duri ng 
which WrBusy was asserted is repeated. WrBusy is 
usua lly tied to the signal that indicates the write buffer is 
full. WrBusy assertion is sampled by the processor in 
phase 2 and the deassertion is sampled in phase 1 of the 
clock cycle. 


SysOut: This is the clock output of the 79R3000 and is 
the clock frequency at which the R3000 is rated. 

CpCondO: The condition of this input signal to the 
R3000 in stall cycles determines if the processor will do 
a single word read or a multiple word read. 


BusErr: This input signal is provided as a mechanism 
to create an exception in the R3000 and as an aid to 
escape from interminable stall cycles. 

AccTypO: This output signal has three functions. During 
cached reads it indicates whether there was a data 
cache miss or an instruction cache miss. This information 
is useful if the block refill size is different for data and 
instructions. During uncached reads it is used with 
AccTypI to indicate the size of the data being read. 
During writes it is used along with the AccTypI to 
indicate the size of the data being written. 

AccTypI : This output signal is undefined for cached 
reads. For uncached read operations and for store 
operations, AccTypI along with AccTypO, indicates the 
size of data transfer. 

AccTyp2: AccTyp2 is undefined for store operations 
with stall cycles. Forloadoperations.it is highforcached 
operations and low for uncached operations. During run 
cycles, this line indicates whether there is any data 
transfer during the second phase. 
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USING IDT73210S 

Figure 3 shows the application of the 73210s as one-deep 
read and one-deep write buffer. Four 73210s are used to 
transfer the 32-bit data and the associated four parity bits. On 
the address bus, four 73210s are used to pass the 32-bit 
physical address and the access type bits(0:1) to the main 
memory. Port B of the 7321 Os are connected to the processor 
side, and Port A of the 73210s are connected to the memory 
side. 

The read and the write data paths are explained in Figures 
4 and 5. In this design, one single set of four IDT7321 Os serve 
the function of read and write buffers. Also, a set of four 
I DT7321 Os are used to capture the addresses during read and 


write operations. The timing diagrams point out the control 
signals that resolve any conflicts in the use of these buffers. 

The control logic, described in the following sections, can 
be implemented with any PLD that matches the processor 
speed. To interface with the main memory, signals are 
defined to make a request to the main memory (MR EQ), to 
specif y a read or a write operation to the main memory (MRD, 
MWR), and a signal from the main memory to indicate the 
completion of read or write operation (CYCEND). 

The memory interface signals from the R3000 are used by 
the PAL state machine inorder to generat e contro l s to the 
buffers and t he main memory. The RdBusy, WrBusy, MREQ, 
MRD, MWR, and the clock and data output enables for the 
73210s are generated by the state machine. 



2647 drw 03 

Figure 3. Using IDT73210 as Read and Write Buffer 
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Read Operations always transparent to bypass the address bits and access 

The data path for the read operations is through register X. type bits. The POLARITY signal is held low to pass the access 
The address and the access type bits go through the latch W type bits as parity bits through the two 7321 0 on the tag bus. 
and the register Z. The lower address bits are clocked in with The low POLARITY signal to the four 7321 Os on the data bus 
SysOut. The tag bits, a long wit h the access type bits, are generates even parity on the data passing through register X. 
registered with inverted SysOut (SysClk). The latch W is 



Address Data 


2647drw04 

Figure 4. Read Data Path. One Deep Read, One Deep Write Buffers Using IDT73210 
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Write Operations 

The data path for write operations is through latch W and 
register Z. Data is clocked in the 7321 Os on the data bus with 
SysClk along wi th the ta g bits. The lower address bits are 
clocked in with SysOut. Once the address and data are 


available in the Z registers, the PAL state machine generates 
the output enable signals and presents the address and the 
data to the memory. The even parity that is generated by the 
CPU passes through the parity unit without getting modified. 



Address Data + Parity 


Figure 5. Write Data Path. One Deep Read, One Deep Write Buffers Using IDT73210 


2647 drw 05 


CONTROL LOGIC 

The control logic for the signals that control the address 
7321 Os, the data 7321 Os, and handshake signals to the main 
memory system is described with simple boolean equations. 
The 73210s are used to capture the data and addresses 
during read and write operations and to provide the system 
with one-level deep read and write data paths. Byte enable 
signals for partial word writes can be generated by extending 
these equations. In this design, block refills are supported and 
instruction streaming is assumed to be enabled. 

The control signals that are utilized by the PAL state 
machine to control the read/write buffers, and communicate 


with the main memory controller are WIP, WrBusy, RdBusy, 
M R D, MW R.MREQ and CYCEN D. The clock input to the PAL 
is inverted SysOut. In the following boolean equations the 
following notation is adopted: 


! 

OR 

AND 


Logical NOT operation. 

Indicates the corresponding signal is active low. 
Logical OR operation. 

Logical AND operation. 

To end a boolean equation. 

Registered output. 

Combinatorial output. 
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Main Memory Controls 


{ RdBusy usually stays asserted even when there is no read } 

{ operation going on. It is d easserte d when the memory system } 

{ acknowledges ( asserti ng CycE nd) that the read is finished. It } 

{ remains deasserted till MemRd gets deasserted. } 

{ It is assumed here that the memory system asserts CYCEND in } 

{ pha se 1 . RdBusy gets deasserted in phase 2, and the CPU puts } 
{ out XEn(s) from the next clock cycle. } 

{ RdBusy is not deasserted with CYCEND associated with a prior } 
{ write operation. } 

{ RdBusy is a registered output. } 


RdBusy := !( !WIP AND Me mRd AN D CYCEND) OR 
!( IRdBusy AND MemRd); 


{A single pulse request is sent out to the main memory system to } 
{ indicate that a read or write operation is coming along. It is } 

{ asserted only when a read or write operation is feasible through } 

{the one deep read and write buffers. } 

{ This is a registered output. } 


MREQ := ( !WIP AND IMemRd AND MemWr AND IMREQ) OR 
( IWIP AND MemRd AND IMREQ ); 


{ A read strobe is given out to the main memory system to } 

{ indicate a read operatio n. This s ignal is asserted while there is } 

{ no write in progress an d Mem Rd is asserted. } 

{ To support block refills MRD stays asserted with MemRd. } 

{ This is a registered output. } 


MRD ;= ( IMRD AND IWIP AND MemRd) OR 
( MRD AND MemRd); 


The control signal CYCEND is asserted by the main memory controller 
to indicate the finish of a write operation or the availability of the first word 
of the block refill in t he read b uffer. 

It is assumed that CYCEND is asserted in phase 1 , so that RdBusy can 
also be deasserted in phase 1. 


{ WIP signal is used to indicate whether the write buffer is in } 

{ the process of retiring its conte nts to the main memory. } 

{ WIPis asserted w hen MemW r is asserted and deasserted when } 
{ an acknowledge (CYCEND) from the main memory system comes } 
{ back indicating that the write is carried out. } 

{ WIP is a registered output ) 


WIP := ( IWIP AND IMemRd AND MemWr) OR 
(WIP AND ICYCEND); 


{ Write busy (WrBusy) is asserted when there is a write in } 

{ progress or when read operation is going on . During read } 

{ operations with streaming enabled, WrBusy should be } 
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{ asserted to stop any writes because there is a common data } 

{ buffer for both read and writes. } 

WrBusy = WIP OR MemRd; 

{ A write strobe is given out to the main memory system to } 

{ indicate a write operation is in progress. } 


MWR = WIP; 

Higher Address Buffer Controls 


{ Controls for 73210s that pass higher address bits (Tag 16:31) } 

{ and access type bits (AccTyp 0,1). } 

{ The path through latch W & register Z is selected by the internal } 
{ Mux. } 

SEL = 1 ; 

{ Register Z is enabled for read and write operations when there } 
{ is no contention between read and writes.L atch W is transparent. } 
{ A read operation in progres s is indicated by MemRd signal. } 

{ For write operations ABEN is enabled for one clock cycle. } 


DMemWr := MemWr; 

ABEN = ( MemRd AND !WIP AND IMemWr) OR 
( MemWr AND !WIP AND IMemRd) OR 
( ABEN AND MemWr AND IDMemWr); 


{ Allows the Access Type bits to pass through “Compliment } 

{ Even/Odd Parity” unit as parity bits without getting modified. } 

POLARITY = 0; 

{ The higher address bits along with the access ty pe bits are } 

{ clocked into the register Z with inverted SysOut. } 

CP = ! SysOut; 

{ The higher address bits along with the access type bit s are put } 

{ out to port A when there is a write in progress or while MemRd } 

{ is asserted. } 

AAOE = WIP OR MemRd; 

{ AEN always disabled for the address 7321 0. } 

AEN = 1; 

{ BOE always disabled for the address 73210. } gp| 

BOE = 1 ; gjjJ 

Lower Address Buffer Controls 

{ Controls for 73210s that pass lower address bits (AddrLo 0:15) } 
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{ The path through latch W & register Z is selected by the internal } 
{ Mux. } 

SEL = 1; 

{ Register Z is enabled for read and write operations when there } 
{ is no contention between read and writes .L atch W is transparent. } 
{ A read operation in progres s is indicated by MemRd signal. } 

{ For write operations ABEN, is enabled for one clock cycle. } 

DMemWr := MemWr; 

ABEN = ( MemRd AND !WIP AND IMemWr) OR 
( MemWr AND IWIP AND IMemRd) OR 
( ABEN AND MemWr AND IDMemWr); 

{ POLARITY is Don’t Care. } 

POLARITY = 0; 

{ The low er address bits are clocked into the register Z with } 

{ SysOut signal, because they are available in the first phase. } 

CP = SysOut; 

{ The lower address bits ar e put out to port A when there is a } 

{ write in progress or while MemRd is asserted. } 

AAOE = WlP OR MemRd; 

{ AEN always disabled for the address 73210. } 

AEN = 1; 

{ BOE always disabled for the address 73210. } 

BOE = 1 ; 

Data Buffer Controls 

{ Controls for 7321 Os that transfer data bits for reads & writes. } 

{ The path through latch W & register Z is selected by the internal } 
{ Mux to provide one-deep write buffer. } 

SEL = 1 ; 

{ Register Z is enabled for write operations when there is no } 

{ read operati on in pro gress. A read operation in progress is j 

{ indicated by MemRd signal. Latch W is transparent. } 

DMemWr := MemWr; 

DBEN = ( MemWr AND IWIP AND IMemRd) OR 
( DBEN AND MemWr AND IDMemWr); 

{ Even polarity generated by the CPU is passed through by setting } 
{POLARITY to ZERO. } 
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POLARITY = 0; 

{ The data bits along wit h the par ity bits are clocked into the } 

{ register Z with inverted SysOut. } 

CP = I SysOut; 

{ The data bits are put out to port A when there is a write in } 

{ progress. } 

DAOE = WlP; 

{ DAEN is enabled during read operations. } 

DAEN = MemRd AND IWIP; 


DBOE is enabled by XEn to read the data from the read buffer. 


TIMING DIAGRAMS 

Figures 6 through 1 1 give the timing waveforms for the one- 
deep read and one-deep write buffer described in Figure 3. 
The signals shown in these figures are described by the 
boolean equations presented earlier. In these timing diagrams, 
the signals that are generated by the PAL state machine are 
shown with a displacement in relation to their input signals. 
Also, some of the signals generated by the PAL state machine 
are registered with SysClk. The main memory interface 
sign als ge nerated by the PAL are MREQ, MRD, CYCEND, 
and M WR. The e nable signals to the address 73210s are 
ABEN, and AAOE. Th e en able sig nals to the data 7321 Os are 
DBEN , DAOE, D AEN, and DBOE. The me mory ac knowledge 
signal, CYCEND is asserted two cycles after MR EQ is asserted, 
for both read and write operations. 

Figure 6 shows read and write ope rations. The memory 
re ad ope ration starts with the MemRd signal b eing a sserted. 
A MREQ pulse is sent out to th e memor y, and MRD signal is 
asserted forthe durationthatthe MemRd signal stays asserted. 
The memory system responds to t he reques t by placing the 
data in the read buffer and asserting CYCEND. Thisdeasserts 
the RdBusy signal. RdBu sy is sampled in phase 1 by the 
processor, and it generates XEn in the next clock cycle. Since 
the one-deep read and write buffers are implement ed using 
the same buffers, during a memory read operation the WrBusy 
signal is asserted to halt any write operations. The address 
enables are asserted through out the r ead ope ration to capture 
the addr esses. For read operations, DAEN is asserted with 
MemRd signal to capture the data coming from the memory. 
The port B output enable forthe data buffers is controlled by 
XEn for reading in the data. The read latency is five clock 
cycles including the fixup cycle. 

For write operations in Figure 6 , the WrBusy signal is 
asserted as long as the write operation is in progress. This is 
indicated by WlP. It should be noticed that the RdBusy signal 
is asserted during write operations to block any read operations. 
The address enables are asserted during the write run cycle, 


and the address output enable is asserted throughout the 
write operation. The data buffer en ables are asserted in the 
same way. It should be noted that WlP is a clocked output. 
The write operation takes three cycles. 

Figure 7 shows a four word data block refill. T he RdBus y 
signal, once deasserted, remains deasserted until MemRd is 
deasserted. 

Figure 8 shows four word instruction block refill with 
streaming enabled. The instruction cache miss occured on 
the instruction II. The refill starts with the basic block 
boundary instruction 10. The processor enters fixup as the 
missed instruction is fetched. The processor streams through 
the rest of the block. 

Figure 9 shows a memory read requested by the processor 
before a previous write is retired to the main memory. The 
state machine puts out a request for the read operation only 
after the compl etion of the write opera tion, in dicated by the 
first assertion of CYCEND. The enable ABEN is not enabled 
forthe read untill the previous write is completed. 

Figure 10 shows two write operations in two consecutive 
clock cycles. Since the write buffer is one word deep, the 
second write is not absorbed by the write buffer, and the 
processor stalls untill the first write is retired to the main 
memory. In the following fixup cycle, the seco nd write is 
completed to the write buffer. The memory request MREQ for 
the second write is only generated in the fixup cycle. The data 
and the address of the second write are not captured by the 
buffers while the fir st write is in progress. In should be noted 
that deassertion of WrBusy is sampled by the processor in 
phase 1 . 

Figure 1 1 shows a write operation occuring in the middle of 
streaming. Streaming starts with instruction II. The next 
instruction 12 issues a write. Since the write busy signal is 

already asserted, instruction streaming is aborted. The 

instruction 12 is executed in the following fixup cycle. WlP is 
asserted only in the fixup cycle. The data and the address of 
the write instruction 12 are not captured during streaming. 



9.10 


20 




USING THE IDT73200 OR IDT73210 AS 
READ AND WRITE BUFFERS WITH R3000 


APPLICATION NOTE AN-65 


SysOut 
Mem Fid 
RdBusy 
MREQ 
MRD 
CYCEND 
XEn 
WiP 
MemWr 
WrBusy 
MWR 

Men 

MOE 

DBEN 

DAOE 

DMR 



Figure 6. Read, Write Operations 
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Figure 7. Data Block Refill 
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Figure 8. Instruction Streaming 
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Figure 9. Read During Write in Progress 
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Figure 10. Write During Write In Progress 
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Figure 11. Write in Streaming 
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TWO DEEP READ AND ONE DEEP WRITE This configuration of the data path uses the registers Y and Z 

IDT73210S can also be used in two-deep read and one for read operations as two-leve! deep buffers. For write 
deep write configuration. For capturing the addresses and the operations, the data is wrtten to the register X, thus providing 
access type bits, four IDT73210s are used with B ports a one-deep write buffer. The read and write data paths are 
connected to the processor. For transfering data, four shown in Figures 12 and 13. It should be noticed that even 
IDT73210s are used with A ports connected to the processor, parity is generated on the data in both the directions. 


AccTypO, 1 Tag 16:31 AdrLoO:15 Data + Parity 



Address Data 
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Figure 12. Read Data Path — Two Deep Read, One Deep Write Buffers Using IDT73210 
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Figure 13. Write Data Path — Two Deep Read, One Deep Write Buffers Using IDT73210 
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Figure 14. Read Data Path — One Deep Read, Two Deep Write Buffers Using IDT73210 


ONE DEEP READ AND TWO DEEP WRITE 

To use IDT73210s in a one deep read and two deep write 
configuration, four IDT73210s are connected to the address 
bus with B ports on the processor side. Four IDT73210S are 
connected to the data bus with B ports on the processor side 
to transmit data. The data path for the read operations is 
shown in Figure 1 4. The address and the access type bits can 
be passed through the latch W and the register Z. Data is read 
back from the memory through the register X after even parity 
is generated. 

Figure 1 5 shows the write data path. To utilize the 7321 Os 
as two deep write buffer, the addresses and the data are 


passed through registers Y and Z. These two registers 
provide the two-level deep buffering for the addresses and the 
data. If any write operations, such as writing to the registers 
of I/O devices, require only one-deep write buffer, then the 
path through the latch W and the register Z is useful for both 
data and the addresses. It should be noticed that to transfer 
access type bits in two-deep write configuration, separate 
two-level deep buffering is required. Increasing the depth of 
the write buffer to two may improve the performance 
significantly if the application executes the second store 
before the first store is absorbed by the main memory. 
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Figure 15. Write Data Path. One Deep Read, Two Deep Write Buffers Using IDT73210 

CONCLUSIONS the memory interface for R3000 based systems. Using 

IDT73210 is an ideal part for one/two-deep readA/vrite IDT73210sontheaddressbus,separatelatchesforcapturing 
buffers for R3000 applications. It is bidirectional, and speed the address low bits can be eliminated. IDT73210 also 
compatible with the existing RISC processors. It generates provides the designer two different data paths from port B to 
and checks even parity and hence reduces the parts count in pod A to be selected dynamically depending on the operation. 
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By Michael J. Miller 


INTRODUCTION 

The IDT79R3000 RISC Microprocessor is increasingly 
selected for demanding embedded tasks in applications 
ranging from avionics controllers (CAP-32) to laser printer 
engines and PBX systems where performance is critical. This 
wide range of use is possible because of the inherently flexible 
yet powerful nature of the R3000 which allows the system 
designer maximum flexibility to achieve both cost and 
performance goals. The IDT79R3001 RISController is the 
first derivative of the R3000 family and builds upon this 
flexibility by allowing the designer to create systems with 
fewercache parts or even systems without cache at all! While 
some processors incorporate an on-chip cache, it is typically 
smallerthan desired for large applications. The R3001 allows 
the designer to decide whether to have a cache and also the 
cache size. Since the R3001 includes the cache control, there 
is no overhead beyond the actual memory components. 

Basic system designs vary according to specific needs of 
the application. To be an ideal embedded control CPU, a 
processor must provide solutions that meet different sets of 
criteria. These criteria, however, vary in importance depending 
upon the particular application. In general, the criteria will fall 
into one of two broad categories: system performance or 
system cost. Performance can be measured in terms of raw 
throughput, context switch time and minimum and maximum 
latency to interrupts. System cost consists of component cost 
and parts count, which are not always the same. For avionics, 
system board space is paramount, not individual component 
cost. The IDT79R3001 RISController was defined to meet 
these requirementsforembedded systems, asweil as maintain 
software and architectural compatibility with the R3000. 
Compatibility allows all of the R3000 development tools to be 


used with the R3001 . While the changes to the R3000 may 
seem deceptively simple, they result in significant benefits to 
the R3000 family. The 79R3001 dramatically reduces cache 
memory costs and lowers the interface parts counts, as well 
as provides support for single-hierarchy memory systems to 
increase performance. 

TWO SYSTEM APPROACHES 

The RISController offers the designer two basic system 
philosophies. The first is to treat a relatively small amount of 
local synchronous memory built with SRAM as a cache to 
allow the R3001 to manage the movement of data between 
the cache and main memory. In this configuration, the local 
memory provides each word of instruction or data with a valid 
bit and an address tag of its page in main memory. If the 
fetched data is not valid or is from the wrong place, the 
processor refetches the data from main memory in an 
asynchronous handshake. The designer programs the TAG 
address width to exactly match the system requirements. For 
example, a cacheable main memory of 8Mbytes and a cache 
of 16Kbytes requires a 9-bit tag and, thus, the total required 
width is 42 bits for data, tag and valid bit. With the large index 
address in the local cache, the designer can choose to lock 
into the cache portions of the program 

The other approach is to incorporate a large amount of local 
memory and manage the loading of data and instructions 
through program control. With this approach, the width of the 
local memory is 32 or 36 bits, depending on whether or not 
parity is selected for protection. The R3001 accomplishes 
both approaches with direct control of the local memory 
through dedicated pins and, thus, avoids extra components 
and maximizes performance of the local memory. 


RISController is a trademark of Integrated Device Technology, Inc. 

C EXECUTIVE Is a registered trademark of JM I Software Consultants, Inc. 


©1990 Integrated Device Technology, Inc. 
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Figure 1. Block Diagram of Two Approaches 


2855drw01 


MAKING A SELECTION 

To select the optimal configuration, the designer must 
determine the computational requirements of the application. 
For example, many applications have a “drop dead” 
performance requirement; a processor unable to achieve this 
minimum performance standard is not a viable candidate for 
the application. Performance can be measured as the time to 
execute a task, respond to an interrupt or perform a complete 
context switch. The performance of the R3000 is demonstrated 
to be 21 VUPS in a 25MHz M/2000 system or 13 VUPs in a 
16MHz M/120 system. Using the JMI C EXECUTIVE® OS 
kernel on an M/120 provides context switches in IOjis and 
system calls in only 8ps. 

Predictable interrupt response times can be achieved by 
locking the kernel into cache. With the R3001, this is 
accomplished by using a high address line (such as AddrLo23) 
as one of the address lines for the instruction cache. In this 
way the instruction cache is divided into two halves, each half 
containing words from a specific region in memory. In this 
example, if all of the kernel code is placed in memory above 
8M and the application code is below 8M, the kernel will not be 
pushed out of the cache by the application code. Furthermore, 


if a kernel such as the “C EXECUTIVE” is used, the kernel will 
never leave the cache because it consists of only 8Kbytes 
which fit entirely into the 16Mb cache that is divided into two 
parts. 

Another key aspect of an embedded control application is 
the parts count and the board area of the design. With this in 
mind, IDT and other SRAM suppliers currently provide wider 
memories as well as integrated logic functions on-chip. For 
example, the IDT71 586 is a 4K x 1 6 SRAM with an integrated 
address latch that can be used to build a processor cache (for 
either the R3000 or 80386) with a low-parts count. Another 
new standard architecture is emerging with IDT’s new 71 222, 
which includes two interleaved banks of 4K x 1 8 SRAM in one 
device. With this approach, a designer can build a system that 
includes both instruction and data caches with minimal parts 
count. Like most other processors, the R3001 needs address 
and data buffers. If operated without cache it will provide 
performance similar to a 68020. By adding a 16Kbyte cache 
using only three parts (IDT71 586), however, the performance 
more than doubles, making a 12.5MHz system achieve 4 
VUPs running large embedded applications such as Page 
Description Interpreters. 
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Figure 2. Locking Code In the Instruction Cache 
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NO CACHES 

A "cacheless” scheme can be used in various embedded 
applications where deterministic behavior must be guaranteed, 
the application has stringent board space constraints and 
where the amount of code to be executed is limited although 
high-performance is demanded. Examples of such applications 
include communication controllers, avionics, robotics and 
switching controllers. 

These systems typically feature compact instruction and 
data spaces because the nature of theirtask is highly repetitive. 
This allows the system designer to compact the memory 
hierarchy and eliminate the need to “cache” large main memory 
in his system. Instead, the R3001 is used in anon-hierarchical 
(without cache) based system, where aH of the needed code 


and data fit in a tightly coupled synchronous memory. Such 
a system achieves the highest performance possible while 
minimizing the part count. Since all memory accesses are 
synchronous (with no cache or TLB misses), the system 
achieves the true (theoretical) performance of the RISC 
engine. At 25MHz operation, the system will achieve 23 VUPs 
with complete predictability for interrupts. 

The example shown in Figure 3 is a system which 
incorporates 1 28Kb each of instruction and data synchronous 
memory, a one deep read/write buffer interface to slower 
peripheral devices, 64Kb of EPROM for instructions and 
initialization data (configured in the asynchronous space) and 
a DUART. The instruction and data memory can also be 
expanded by using 64K x 4 or 128K x 8 memories to make 
systems with 256K and 51 2K bytes, respectively. 
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Figure 3. Low Parts Count System Using the IDT79R3001 


Slower devices, such as the boot EPROM and the I/O 
devices, reside in the “asynchronous" address space of the 
system. The R3001 divides its 4GB virtual address space into 
4 sections: 

1 . The kernel/usersegment which is both cached and mapped 
by the TLB. 

2. Kernel segment 0 which is not mapped but is cacheable. 
This is where the system's synchronous memory exists. 

3. Kernel segment 1 which is neither mapped norcacheable. 
The processor reset vector as well as the peripheral 
device are located in this space. 

4. The kernel segment 2 which, like the kernel/usersegment, 
is both cacheable and mapped by the TLB. 

The asynchronous bus is used when accesses to ksegl are 
issued by software, as well as when the system is initializing 
itself. 

The asynchronous bus supports I/O and memory devices 
with varying access speeds. In this example, the asynchronous 


address space is 256Kb and the asynchronous data bus is 8 
bits wide. The design includes 1 28Kb EPROM ( two 64K x 8 
device) and one DUART. The EPROM holds the boot code, 
initialization data and application code. It would be reasonably 
simple to scale this design to include other I/O devices as well. 

CONCLUSION 

One of the strengths of the R3000 is its ability to support 
different memory hierachies and cache configurations. IDT 
made modifications to the R3000 to create the IDT79R3001 
RISController. It retains the flexibility of the original R3000 
while meeting controller design requirements such as reduced 
parts count, small board real estate and predictable interrupt 
response. While future versions of the family will provide 
higher levels of integration such as on-board cache, there will 
always be a version which allows the designer to tailor the 
memory hierarchy. 
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Figure 4. Using the IDT79R3001 In a Non-cache Approach 
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INTRODUCTION 

The proliferation of high-speed RISC and CISC 
microprocessors has created a demand forreal-time debugging 
tools. This application note shows how a real-time logic 
tracing tool can be created using IDT71502 multifunction 
RAMs. The IDT71502 can be used as a stand-alone logic 
analyzeror as part of an embedded fault monitor and analysis 
system. Details of howto apply this system to an R3000 RISC 
microprocessor-based system are given. The discussion in 
this paper is also equally valid for use in high-speed CISC 
processor-based designs. 

The IDT71502 can be used to function either as a logic 
tracing device or as a test pattern generator. As a logic tracing 
device the IDT71 502 can record bus activity continuously and 
then be stopped on a predetermined event such as a bus 
error. This allows the activity leading up to the “event” to be 
recorded for analysis. Since the trace function is 
accommodated in a single device, embedded tracing is more 
likely to be practical. 

DESCRIPTION OF IDT71502 
MULTIFUNCTION RAM 

IDT71502 is a 4K x 1 6 multifunction RAM with an address 
set-up time of 25ns. It has a breakpoint comparator.l 6-bit 


pipeline register and an address counter. In addition, there is 
a 16-bit set-up register used to set the chip operating mode 
and to read back chip operating status conditions. It includes 
a serial control interface called the serial protocol channel 
(SPC™ ) which is available in a variety of other products from 
IDT as well. The SPC logic, as implemented in the IDT71 502, 
has one 8-bit command shift register, a command decode 
register and a 1 6-bit data shift register. The serial data shift 
register can be configured to operate in a diagnostic mode. In 
the diagnostic mode of operation, the shift registercan read all 
status conditions on the chip such asthe RAM output, pipeline 
register output, data output pin state and RAM load/read 
counter value. 

The serial protocol channel consists of a four-pin interface 
bus through which the user can access the internal registers 
of the IDT71502. The four pins are: 

(a) Serial data input pin (SDI) for sending data and 
commands to the device. 

(b) Serial data output pin (SDO) for extracting data from 
the device. 

(c) Serial clock pin (SCLK) for clocking data and commands. 

(d) Command/Data mode pin (C/D) to provide command 
or data identification to the device. 
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Figure 1. 


SPC is a trademark of Integrated Device Technology. Inc. 
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This four-bit bus can be very conveniently connected to an 
RS-232C line for direct serial communication with a computer 
and Figure 1 illustrates one scheme to achieve this. The user 
is urged to refer to the “IDT71502 FUNCTIONALITY 
DEMONSTRATION BOARD USER MANUAL" for more 
information on interfacing the I DT71 502 to the RS-232C serial 
communication line. The SPC’s eight bit command is divided 
into a four-bit command field and a four-bit register field. The 
four-bit command field is used to determine whether a read or 
a write operation will be executed. The four-bit register field 
of the command register is used to select the various internal 
registers and the external pins on which the read or write will 
take place. Thus, the four-bit command field and the four-bit 
registerfield can effectively access any internal register for a 
read or a write operation and monitor the state of the external 
pins. 

Table 1 summarizes the SPC commands, and Register 
code s and the set-up register format. When the command/ 
data line is high, commands are serially clocked through the 
SCLK into the internal command reg ister via the serial data 
input pin (SDI). When the command/data line is low, data is 
serially clocked by the SCLK into the internal data register via 
the serial data input pin (SDI). The SPC commands are 
executed whenever the C/D line transitions from a command 
mode (logic 1) to a data mode (logic 0). This device, when 
configured to operate in the trace mode, serves as a real-time 
debugging tool analogous to a logic analyzer. 
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SET-UP/STATUS REGISTER CODE 


Bit 

Name 

Operation Performed 

15 

CE 

Read Only 

14 

SOEFF 

Read Only 

13 

SOEPin 

Read Only 

12 

OE Pin 

Read Only 

11 

WE Pin 

Read Only 

10 

iNlT Pin 

Read Only 

9 

BP Compare 

Read Only 

8 

BP Pin 

Read Only 

7 

CST 

Read/Write 

6 

CSO 

Read/Write 

5 

Non-Reg High 

Read/Write 

4 

Non-Reg Low 


3 

— 

— 

2 

BC-ADDRS 

Read/Write 

1 

BC Pipelined 

Read/Write 

0 

Trace Mode 

Read/Write 
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SPC COMMAND CODES SPC REGISTER CODES 


Register Code 
(Hexadecimal) 

Read/Write 

Function 

Register 

0 

Read/Write 

RAM Counter 

1 

Read/Write 

RAM Output/Input 

2 

Read/Write 

Pipeline Register 

3 

Read/Write 

Break Mask Register 

4 

Read/Write 

Break Data Register 

5 

Read/Write 

Setup and Status Register 

6 

Read Only 

1/015 - 1/00 (Data Pins) 

7 

Read Only 

RAM Address Pins 

8-F 

Reserved 

Reserved (Unused) 


Command Code 
(Hexadecimal) 

Read/Write 

Function 

Operation 

0 

Read 

Read Register 

1 

Write 

Write Register 

2 

Read RAM 

Read RAM and 
Increment Counter 

3 

Write RAM 

Write RAM and 
Increment Counter 

4-C 

Reserved 

(Reserved NO-OP) 

D 

Write 

Stub Diagnostic 

E 

Write 

Serial Diagnostic 

F 

Reserved 

(Reserved NO-OP) 
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Table 1. 
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AN R3000-BASED SYSTEM 

A block diagram of a R3000-based system’s CPU and its 
memory interface is shown in Figure 2. It consists of the CPU 
and FPU, data and instruction caches and the read and write 
buffers connected to the CPU and the system bus. This is a 
typical configuration found in embedded or general purpose- 
type systems which use the R3000. To reduce the burden of 


thesystemdesignerwhoisinterested inusingthelDT79R3000, 
the CPU and FPU, as well as the instruction and data caches 
with the read and write buffers, are now available in a compact 
module (IDT7RS101) which can be connected to the user’s 
system bus. This approach to system design vastly reduces 
the design cycle time by shifting the design emphasis to main 
memory and I/O interfaces. 


CPU Data 
Bus 



CPU Address 
Bus 


To Main Memory 
and I/O Interface 
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Figure 2. A Generic R3000 Microprocessor Based System 


When debugging a system board based on the IDT7RS1 01 
or its equivalent, the majority of debugging is done by monitoring 
the cache to main memory interface on the main memory side 
of this interface. An embedded trace function may operate in 
the same way. This keeps the capacitance of the trace RAM 
pins out of the speed critical cache buses. If desired, the 
R3000can be operated in the uncached mode. This forces all 
accesses to main memory and allows every memory access 
of the processor to be monitored from the cache to main 
memory interface. The user wishing to operate in the uncached 
mode can do so by setting bit 1 1 of the TLB entry register to 
1 , indicating uncached mode, oroperate the software in virtual 
address space ksegl. Ksegl is kernal-mode virtual addressing 
space which is uncached and is 512 Mbytes long starting at 
virtual address 0xa000_0000. With this approach, the user 
must define instruction space and data space in the main 


memory and must provide an address decoded input to the 
IDT71502 tracing the control bus. This input will be used to 
determine whether an instruction or data related transaction 
occurred during that clock period. 

Another approach is to tie the address valid bit on the TAG 
bus to ground via a 300 Ohm resistor. This is necessary to 
prevent a direct short from occurring when the CPU is driving 
the TAG bus. Tying the address valid bit on the TAG bus to 
ground will result in invalidating the cache TAGs and cache 
misses will occur, resulting in the processor accessing main 
memory to get that information. Whenever the main memory 
is accessed to get information after a cache miss, the processor 
puts out information on the Access Type pins, indicating the 
size of the word to be transferred and that it was a cached 
reference. AccTyp(2) pin output indicates a cached reference 
when 1 and an uncached reference when 0. AccTyp(O) 
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indicatesa Data reference when 0 and an instruction reference 
when 1 . The AccTyp signals are latched using our control 
trace RAM and will determine whether an instruction or data 
transaction occurred during that clock period. 

A user wishing to implement his own cache can use the 
IDT71502 in the trace mode to monitor the cache. However, 


it should be pointed out that the timing for this part of his 
system is more stringent. The user may have to register trace 
data before clocking it into the IDT71502S to meet the 
IDT71502s set-up and hold time restrictions. 


To Main Memory 
and I/O Devices 



BSysOut 
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Figure 3. Block Diagram to Trace Instructions, Data, Instruction Addresses and Data Addresses 
on the System Bus of an R3000-based System 


DESCRIPTION OF THE MONITOR CIRCUIT 

Figure 3 shows the block diagram of an implementation of 
the monitor circuit. It is placed on the system bus between 
main memory and the write buffer of the R3000. The R3000 
uses the write through cache update policy to ensure data 
coherency. The function of the write buffer is to capture data 
and addresses output by the CPU and ensure that data is 
passed on to main memory. The read buffer is used for 
temporary storage of data during data transfers between main 
memory and the CPU. Depending on the block refill size, the 
readbuffer can be 1 , 4, 8, 1 6or32 words deep. The block refill 
size of the system is fixed during the system reset operation. 
The R3000’s CpCondO input can be set to a 0 to indicate a 
single word transfer or can be set to a 1 to indicate a block 


transfer by the external memory controller. The PAL state 
machine is used to gene rate the appropriate IDT71502 strobes 
to capture instructions, data, instruction addresses and data 
addresses. 

The IDT71502s labeled “1 ” in Figure 3 is used for capturing 
data and instruction addresses; IDT7 1 502s labeled "2” is used 
for capturing data and instructions. The IDT71502 labeled “3” 
is used to trace the control bus signals. In this application note, 
we assume a single word deep read buffer. If a system is 
designed for all possible types of data transfers (i.e bytes, half 
words, tribytes, words and block refills), our PAL equations will 
also have to change to generate the strobes necessary to 
trace these data transfers. 
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Figure 5. Main Memory Write Cycle (Single Word Write) 


TIMING ANALYSIS 

Figure 4 is the timing waveform for a single word read and 
Figure 5 is the timing waveform for a single word write. Since 
the system bus timing parameters are dependent on an 
external memory controller, Table 2 summarizesthe important 
handshaking signals needed to satisfy the protocol necessary 
to trace system bus signals. 

AddrOutEn is an input to the read buffer from the main 
memory controller. When asserted, this input will enable the 
addres s that is registered in the read buffer to the system bus. 
McRd is a read strobe that i s genera ted by the main memory 
controlle r in response to a MemRd pulse from the R3000. 
RBDEn is a main memory controller input to the read buffer 


that registers the data available on the system data bus into 
th e read b uffer. 

WrAck is an input to the write buffer from the main memory 
controller. It indicat es that it has written the word presented to 
it to main memory. RdAck is also a main mem ory controller 
output t hat is used to generate the RdBusy signal to the 
R3000. TWE is an input to the IDT71502S that latches data 
addresses, instruction addresses, data, and instructions; it is 
also an output from our PAL state machine. TCLK is the clock 
input to the IDT71502S tracing data addresses, instru ction 
addresses, data, and instructions. The signals TWE and 
TCLK are also used as inputs to the IDT71502S in order to 
trace the control bus signals. 
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Signal 

Function 

BMemRd 

The buffered memory read signal from the R3000 

BMemWr 

The buffered memory write signal from the R3000 

AddrOutEn 

Read buffer address output enable signal from the memory controller 

McRd 

Main memory read strobe from the memory controller 

RBDEn 

Read buffer data enable strobe from memory controller 

WrAck 

Write acknowledge to write buffer from memory controller 

RdAck 

Read acknowledge to read buffer from memory controller 

TWE 

Write enable input to IDT71502 from PAL state machine 

TCLK 

Clock input to IDT71502 from PAL state machine 


TIMING SPECIFICATIONS FOR THE 
IDT71502S 

tTWDS is the IDT71502 specification defined as “Trace 
Write Data Set-up Time”. The user must satisfy the following 
condition: 


tTWDS > 8ns 

tTWDH is the IDT71502 specification defined as “Trace 
Write Data Hold Time”. The user must satisfy the following 
condition: 


tTWDH > 2ns 

tTWS is the IDT71502 specification defined as “Trace Write 
Enable Set-up Time.” The user must satisfy the following 
condition: 


tTWS > 8ns 

tTWH is the IDT71502 specification defined as “Trace Write 
Enable Hold Time.” The user must satisfy the following 
condition: 

tTWH > 2ns 


CONCLUSION 


The I DT7 1 502 is a multifunction RAM that is fast enough to 
be used to trace the operation on most high-speed 
microprocessors including the IDT79R3000 RISC 
microprocessor. The 25ns speed grade can be used to trace 
full speed the operation of this processor up to 25MHz. The 
discussion in this paper focused on providing the pertinent 
information needed to construct a monitor circuit based on 
IDT71502 multifunction RAMs to trace the system bus of an 
R3000-based system. This discussion is also valid for users 
interested in using the IDT71502 RAMs in a trace mode to 
monitor system buses based on other high-speed processors. 

Microprocessor based systems are usually provided with 
software routines that are used as diagnostic tools to test 
system primary and secondary memory for failures. These 
programs also test I/O devices before the user receives a 
prompt, telling him the system as a whole is ready for service. 
This procedure is usually carried out after system reset, but 
occasionally during normal operation the system “crashes” in 
the middle of some critical task and the user has no clue as to 
what happened priorto the “crash”. The I DT71 502 multifunction 
RAMs, when operated in trace mode and mounted permanently 
on critical system paths, can serve as “black boxes”to give the 
user this very important information. This information can 
then be very conveniently retrieved via the four bit serial 
protocol channel connected to the RS-232 connector and the 
reason for the crash can be determined. 

The I DT71 502 is a multifunction RAM that has the capability 
to serve as a valuable logic monitoring tool. It contains the 
Serial Protocol Channel and a breakpoint comparator, has a 
4K x 1 6 memory space and is available with an access speed 
of 25ns. Thus, it is well-suited for use as a single chip logic 
analyzer in high-speed, high-density environs. 
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MULTIPROCESSOR SYSTEMS 


by Kelly Mass 


The IDT7MB6049 is a complete cache module for the 
I DT79R3000 RISC processor and is designed for both single- 
and multi-processor systems. It has two banks of SRAMs, 
each configured as 16K x 60, and each with address latches. 
One bank is used to cache instructions, the other to cache 
data. They share a data bus, allowing one bank to be 
accessed at a time. 

Use in multi-processor systems, is facilitated by a second 
address bus and an additional set of latches forthat bus. This 
bus is used in multi-processor applications to latch an address 
from a source other than the R3000. This allows the system 
to invalidate entries in the data cache in conjunction with the 
R3000. This is done in order to maintain cache coherency. 
The set of address latches forthe instruction cache is included 
in the module for symmetry, although normally no invalidations 
are done to the instruction cache. I nstrudion cache invalidation 
would require cache swapping, but only data cache invalidation 
is described below. 

When the system wants to invalidate an entry in the data 
cache, it forces the R3000 into an MP Stall by asserting 
CpCond(3). During the one clock cycle that it takes for the 
processor to enter the MP Stall, it is the responsibility of the 
system to disable the output of the latch which supplies the 
processor's address to the data cache, and enable the output 
of th e latch w hich s upplies the invalidate address . The module 
pins P10E(1) and P20E(1) are used for this purpose. It is 
important that they should never be activated simultaneously 
since the outputs of the l atches are tied together. The same 
applies to P10E(2) and P20E(2) for the instruction cache. 
Both address latches for the data cache are normally clocked 
by the same DCIk signal from the R3000 through the PI LE(1 ) 
and P2LE(1) pins of the 7MB6049. 

Once the processor is in MP Stall, it strobes DRd while 
CpCond(2) is unasserted, allowing the system to read the 
contents of the cache. The actual invalidation of the data 
cache entries begins when the system asserts CpCond(2) 
and provides the appropriate invalidate address. CpCo nd(2) 
causes the R3000 to output an invalid bit and strobe DWr. 
Multiple invalidations are performed by keeping CpCond(2) 
and (3) asserted, and changing the invalidate address. Note 
that the invalidate address timing must be consistent with the 
processor timing. One suggestion is that the invalidate 
address input of the module be driven by a register that is 
clocked by SysOut. __ 

The IDT7MB6049 has two chip select (CS) signals. Both 
of these should be grounded if the cache is not depth expanded. 


The four output enable (OE) and four write enable (WE) 
signals are split evenly between the data and instruction 
cache: (1-2) control the data cache, and (3-4) control the 
instruction cache. 

OE(1 -2) of the 7 MB 6049 c onnect to the DRdl and DRd2 on 
theR3000. DRdl and DRd2 are identical, and the load should 
be distribut ed e venly betw een them. Likewis e, OE (3-4) 
conne ct to ]Rd1 and IRd2, W E(1-2) con nect to DWrl and 
DWr2, and WE(3-4) connect to DWrl and DWr2. 

The convention of the pin naming of the 7MB6049 is that PI 
refers to the address from the R3000 , and that P2 refers to the 
(invalidate) address from the system. Likewise, (1) refers to 
the data cache and (2 ) refer s to th e instruction cache. As 
shown in Figure 2, PI LE(1 ) and P2LE(1 ) are typically connected 
together to DCI k since they l atch ad dresses into the two data 
cache latches. P1 LE(2) and P2LE(2) likewise are connected 
together to ICIk. P2LE(2) is not used if instruction cache 
invalidation i s not p erform ed. 

Similarly, P10E(1) and P10E(2) are typically connected 
together so that the outputs of the two R3000 a ddres s latch es 
are enabled and disabled together, while P20E(1 ) and P20E(2) 
can toget her co ntrol the output of the invalidate address 
latches. P20E(2) may be pulled continuously high if the 
instruction invalidate address latch is unused. 

The 60 data I/O pins of the module are labeled D(0) to 
D(59). Although the ordering of the data and address pins of 
a RAM is normally arbitrary and can be ignored , that is not the 
case with the 7MB6049. Because of steps taken to reduce the 
chip count and power consumption of the module, Tag(12)- 
Tag(15) of the R3000 must connect to D(36)-D(39) on the 
7MB6049, and AdrLo(12)-AdrLo(15) of the R3000 must 
connect to P1A(10)-P1A(13) on the 7MB6049. The order in 
which the other I/O pins are connected is not critical. Table 1 
shows recommended I/O pin connections between the R3000 
and 7MB6049. 


R3000 Signals 

IDT7MB6049 Signals 

Data 

Data(0) - Data(31) 

D(0) - D(31) 

Data Parity 

DataP(O) - DataP(3) 

D(32) - D(35) 

Tag 

Tag(12) - Tag(31) 

D(36) - D(55) 

Tag Parity 

TAgP(0) - TagP(2) 

D(56) - D(58) 

Tag Valid 

TagV 

D(59) 
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Table 1. Connection of Data and Tag Buses 
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R3000 

DCLK 


ICLK 


AddrLo (2-15) 

DWrl, DWr2 

DRdl, DRd2 

lRdi,TRd2 

lWri,lWr2 

DATA AND 
TAG BUSES 

\7 


IDT7MB6049 


P2A(0)-P2A(13) 

P1LE(1) 

P20E(1) 

P2LE(1) 

P20E(2) 

P1LE(2) 

PIOE(I) 

P2LE{2) 

P10E(2) 

PI A(0)-P1A(13) 


WE(1), WE(2) 


OE(1), OE(2) 


OE(3), OE(4) 


WE(3), WE(4) 

D(0)-D(59) 

CS(1), CS(2) 



INVALIDATE ADDRESS (2-15) 
INVALID ADDRESS ENABLE 

PROCESSOR ADDRESS ENABLE 


A 




J 


TO 

SYSTEM 


Figure 2. Pin Connections of the IDT7MB6049 
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1.0 GENERAL DESCRIPTION 

The IDT79R3001 is a RISC microprocessor which is used 
in a variety of applications ranging from low-end embedded 
controllers to high-end workstations. Currently, the R3001 
operates at a frequency of up to 33 MHz. This specification 
does not explain the functionality of the R3001 nor its architec- 
ture but is limited to describing the key timing parameters. For 
a more detailed description of the functionality and architec- 
tural description of the R3001 , please refer to the references 
listed at the end of this document 

This document starts with a brief description of the R3001 , 
the three-phase clock inputs, cache timings, required timings 
for SRAMs to function as cache for the R3001 , two technical 
notes explaining the factors used in the timing calculations, 
and the conclusion reached. 

2.0 FUNCTIONAL DESCRIPTION 

The IDT79R3001 is a 32-bit RISC microprocessor that is 
currently available from Integrated Device Technology, Inc. in 
two packages: the 144-pin PGA and the 172-pin ceramic 
flatpack. It has a 32-bit data bus, a 32-bit address bus that is 
divided into the low-order bits (AdrLo) and the high-order bits 
(TAG), control signals for the cache, control signals for main 
memory, and power and ground pins. The R3001 also has 
three double frequency clock inputs and one clock output 
used for interfacing the R3001 to the external world. 


3.0 THREE CLOCKS AND DELAY-LINE 
SETTINGS 

Figure 3.1 shows a block level diagram of the R3001 with 
its three clock inputs coming from a delay line. Table 3.1 
shows a summary of the delay line settings to be used for 
different operating frequencies of the R3001. Please note 
carefully that C!k2xSys Is taken ast he zero time reference 
and comes from the first tap of the delay line. The other 2x 
clocks lag Clk2xSys in time and follow it with respect to delay 
line taps. 

4.0 DERATING CALCULATIONS AND CACHE 
TIMING CONSIDERATIONS USING x4 
SRAMS 

The design of the cache subsystem for the R3001 is 
straightforward. Industry standard static RAMs function as 
cache. This chapter discusses the methodology used to 
calculate the critical timing parameters for a static RAM so that 
it can function as cache forthe R300 1 . This chapter examines 
the timings for a 16 MHz, 20 MHz, 25 MHz, and 33 MHz 
R3001. The timing equations derived take into account the 
effect of capacitive loading on the bus. The derating factors 
are calculated based on certain assumptions. These assump- 
tions are detailed in this chapter and the derating factors 
calculated. The timing equations are then discussed. At the 
end of this chapter a table containing the SRAM timings (for 
different operating frequency of the R3001) is included. 



Figure 3.1. Three-Phase Clock Input to the R3001 


Parameter 

16 MHz 

20 MHz 

25 MHz 


Clk2xSys 

0 

0 

0 

0 

Clk2xSmp 

6 

6 

6 

4.5 

Clk2xPhi 

16 

14 

12 

9 


Table 3.1. Delay Line Settings for R3001 Operating at Different Frequencies 
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Figure 4.1. Block Level Diagram of a Cache Subsystem with the R3001 Using IDT7198 16K x 4 to 
Function as Cache (the R3010 Is not shown In this figure) 


4.1 Device Capacitance 

Figure 4.1 shows a typical R3001 based system. The 
cache comprises of fast 16 K x 4 static RAMs e.g., the 
IDT7198. The AdrLo bus of the R3001 goes through a high- 
speed transparent latch: the FCT373. It also goes through a 
latch which is used to address the main memory. All the 
devices have an input and an output capacitance. In addition, 
each device is capable of driving a certain load. These 
parameters: the input capacitance, the output capacitance, 
and the load capacitance are given in Table 4.1 . 

The cache format of the R300 1 comprises of 48 bits: 32 bits 
of data, 15 bits of tag and, a valid bit. With this requirement, 
it is clear that for the instruction cache, 12 IDT7198S (16K x 4 
SRAMs) are needed. The data cache has the same format. 
This means that there are a total of 24 SRAM devices for the 
cache. 


Device 

U of 

Devices 

Capacitance 

Total 

Capacitance 

R3000 

1 

Cin =10 pF 

10 pF 

IDT7198 

2 

Cout = 7 pF 

14 pF 

IDT374A 

1 

Cout = 12 pF 

12 pF 

IDT823B 

1 

Cin = 10 pF 

10 pF 


Table 4.1. Capacitances of the Various Devices in a Typical 
R3001 System 
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4.2 Assumptions for Surface Mount Layout Design With 
'x4' SRAMs 

In the following sections, certain assumptions have been 
made while calculating the derating factors. These are as 
follows: 

1) The trace has a capacitance of 2 pF/inch. 

2) The speed of light is 2 ns/foot in epoxy. 

3) The R3001 speeds are specified with a loading of 25pF. 
For every additional 25 pF, there is a delay of 1 ns. 

Note that the cache control signals are specified with a 50pF 
load and derate 1 ns/25pF after that. 

4 The distances between the R3001 and the latches are 
approximately 1 inch each. 

5) The distances between the R3001 and the RAMs ap proxi- 
mately 4 inches each. 

6) In all of the assumption, it is assumed that a surface mount 
package is used. 

Figure 4.2 shows a brief mechanical layout of an R3001 
board. 



Assume read and write buffer underneath the board 

Figure 4.2. Surface Mount Board of an R3001 System with Cache 
and Main Memory Interface and Approximate Distances Between the 
Various Devices 


4.2.1 Address Bus Derating Calculations 

For the system shown in Figure 4.1 each address bit is 
connected to five latches: one going to the main memory 
interface buffer, two to the instruction cache and tag memory, 
and two to the data cache and tag memory respectively. The 
latches in turn are connected to the address pins on the static 
RAM. Figure 4.3 shows all the devices that each address bit 
is connected to. 


Bit A2 



Figure 4.3. Block Diagram Showing Various Devices Connected 
to One Address Bit 


Trace length from the CPU to the latch = 4 inches (4.1) 
Capacitance of the trace, Ctrace =4X2 pF/inch = 8 pF (4.2) 
Input capacitance of the 373 latch = 10 pF (4.3) 

As each address bit is connected to five latches (FCT373), 
Total input capacitance due to 5 devices, 

C373in = 5 X 1 0 = 50 pF (4.4) 

Total capacitive load = Ctrace + C373in = 8 + 50 = 58 pF(4.5) 
The rated R3001 load, CL(R300i) = 25 pF (4.6) 

From Eq. (4.5) and Eq. (4.6), Extra capacitive loading for 
the R3001 = 58 - 25 = 33 pF (4.7) 

Let us now examine the capacitive loading between the 
latches and the RAM. 


Path length from latches (373s) to RAM (7198s) = 3" (4.8) 
Trace capacitance from latch to RAM = 3X2 pF/in = 6pF 


(4.9) 

Input capacitance of the RAM = 5 pF (4.10) 

Each output from the latch is connected to eight RAM devices. 
Load due to 8 devices = 6 X 5 = 30 pF (4.1 1 ) 

Total capacitance = 30 + 6 = 36 pF (4.12) 

The rated '373 load = 50 pF (4.13) 


From Eq. (4.12) and Eq. (4.13) it can be seen that there is 
no delay due to the capacitive load between the latch and the 
RAM. 

However, there is a delay due to the capacitive load 
between the R3001 and the latch. This delay can be calcu- 
lated as follows: 


For every extra 25 pF of load, there is a delay of Ins (4.14) 
From 4.7 and 4.14, delay due to the capacitive load = 33/25 
= 1.32 ns (4.15) 

The speed of light = 2 ns/foot (4.1 6) 

For a maximum path length of 5", delay = 5712” X 2 = 0.8 ns 

(4.17) 


From Eq. (4.15) and Eq. (4.17), 

Total propagation delay for the address bus, AdrLo d = 1.32 + 
0.8 =2 ns (4.18) 


4.2.2 Data Bus Derating Calculations 


The derating calculations for the data path are similar to 
those done for the address path. The data bus is connected 
to the floating point unit (R3010), the instruction cache 
(IDT7198), the data cache (IDT7198), a read register 
(FCT374A), and a write register (FCT823). This is shown in 
Figure 4.4. Two cases must be considered: a data store and 
a data fetch. Both are discussed. 
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Bit DO 



Figure 4.4. Block Diagram Showing Various Devices Connected toOne Data Bit 


4.2.2.1 Data Store (R3001 CPU Outputs Data): 

Each data bit is connected to two RAMs (7198s) — one for 
instruction and one for data. 


The path length for the data bus = 5" (4.18) 

Trace capacitance for the data bus = 5X2 pF/in=10pF 

(4.19) 


Capacitive loading due to devices, 


Cdevices = 2 X CRAMin + C374in + C823 + CR3010 (4.20) 

Cdevices = 2x7 + 12 + 10 + 10 = 46 pF (4.21) 

Total capacitive load = Ctrace + Cdevices = 46 + 1 0 = 56 pF 

(4.22) 

Propagation delay due to speed of light = 


571 2" x 2 = 0.8 ns (4.23) 

Delay due to capacitive load = (56 -25 ) /25 = 1 .24 ns 

(4.24) 

From Eq. (4.23) and Eq. (4.24), 

Total propagation delay on a store = 1 .24 + 0.8 = 2 ns 

(4.25) 


4.2.2.2 Load (RAM Provides Data) 

Since the trace length is the same, Ctrace = 10 pF (4.26) 
Capacitive load due to devices, 

Cdevices = CR3001 + CR3010 + CRAMin + C374in + C823 
Cdevices = 10 + 10 + 12 + 10 + 7 = 49 pF (4.27) 

Total capacitance = Ctrace + Cdevices = 1 0 + 49 = 59 pF 
The RAM rated drive is 30 pF. 

Extra load = Total capacitance - RAM rated drive = 

59 - 30 = 29 pF (4.28) 


Propagation delay due to capacitive load = 29/25 = 1.16 ns 

(4.29) 

Propagation delay due to the path length = 0.8 ns (4.30) 
Total propagation delay = 1.16 + 0.8 = 2 ns (4.31) 

4.2.3 Read and Write Control Derating Calculations 

The effect of the capacitance on thecontrolsignalsfromthe 
R3001 processor to the caches and the memory interface is 
cons idere d here. T he co ntrol signals on the R3001 are the 
IRd, DRd, IWr, and DWr which control the instruction cache 
read, data cache read, instruction cache write, and data cache 
write respectively. The read and write control signals are 
connected to the output enable (OE), and write enable (WE) of 
the instruction and data cache, respectively. Assuming the 
use of a 16 K x 4 IDT7198 static RAM, each control signal is 
connected to 8 such static RAMs. 


Number of devices (SRAM) connected to each 
control line =12 (4.32) 

Input capacitance of each device (SRAM) = 5 pF (4.33) 
Total load capacitance = 5 x 12 = 60 pF (4.34) 

Path length = 5" (4.35) 

Trace Capacitance = 5x2 pF/in = 1 0 pF = 1 0 pF (4.36) 
Total capacitance = 60+ 1 0 = 70 pF (4.37) 

Extra capacitive load = 70 - 50 = 20 pF (4.38) 

Propagation delay due to capacitive load = 1 ns (4.39) 
Propagation delay due to the trace length = 0.8 ns (4.40) 
Total propagation delay = 1 + 0.8 * 2 ns (4.41 ) 
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Figure 4.5. Board Layout for a Through-Hole Design of an R3001 Cache Subsystem 


4.3 Assumptions for Through-Hole Layout Design 
Using x4 SRAMs 

In this section, the deratings are calculated for a through 
hole design. Figure 4.5 shows an example of the layout of a 
through hole design. This layout corresponds to an demon- 
stration board used extensively at IDT. The data trace lengths 
are 10 inches and the address trace lengths are 9 inches. 

4.3.1 Address Derating Calculations 

For the system shown in Figure 4.3, the number of devices 
connected to the R3001 is the same. 

Trace length from the CPU to the latch = 9 (4.42) 

Trace capacitance = Ctrace = 9 x 2 = 18 pF (4.43) 

Input capacitance of the latches = 1 0 pF (4.44) 

Total capacitance = 5 x C373 + Ctrace = 5x10 + 18 = 68 pF 

(4.45) 

Extra load on the R3001 = Cl = 68 - 25 = 43 pF (4.46) 
Since the rated 373 load is 50 pF, there is no derating factor 
between the FCT373 and the RAMs. 

Therefore the derating is between the R3001 and the latches. 
Delay due to capacitance = 43/25 = 1 .75 ns (4.47) 

Propagation delay due to the trace length = 9/12 x 2 = 1 .5 
ns (4.48) 

Total derating on the address bus = 1 .75 + 1 .5 = 3 ns (4.49) 

4.3.2 Derating on the Data Bus 

As in section 4.2.2, the derating for the data bus is calcu- 
lated for two cases: i) an instruction fetch, and ii) data store. 


4.3.2.1 Data Store (R3001 CPU Outputs Data) 

Each data bit is connected to two RAMs (7198s) - one for 
instruction and one for data. 

The path length for the data bus = 10" (4.50) 

Trace capacitance for the data bus = 10x2 pF/in = 20 pF 

(4.51) 

Capacitive loading due to devices, 

Cdevices = 2 X CRAMin + C374in + C823 + CR3010 (4.52) 

Cdevices = 2x7 + 12 + 10 + 10 = 46 pF (4.53) 

Total capacitive load = Ctrace + Cdevices =20 + 46 =66 pF 

(4.54) 

Propagation delay due to speed of light = 1 071 2” x 2 = 1 .6 ns 

(4.55) 

Delay due to capacitive load = (66 -25 ) /25 = 1 .55 ns (4.56) 
From Eq. (4.23) and Eq. (4.24), 

Total propagation delay on a store = 1 .54 + 1 .67 = 3 ns 

(4.57) 

4. 3. 2.2 Load (RAM Provides Data) 

Since the trace length is the same, Ctrace = 20 pF (4.58) 
Capacitive load due to devices, 

Cdevices = CR3001 + CR3010 + CRAMin + C374in + C823 
Cdevices = 10 + 10 + 12 + 10 + 7 = 49 pF (4.59) 

Total capacitance = Ctrace + Cdevices = 20 + 49 = 69 pF 
The RAM rated drive is 30 pF. 

Extra load = Total capacitance - RAM rated drive = 

69-30 = 39 pF (4.60) 

Propagation delay due to capacitive load = 39/30 = 1.3 ns 

(4.61) 

Propagation delay due to the path length = 1 .6 ns (4.62) 
Total propagation delay = 1 .3 + 1 .6 = 3 ns (4.63) 
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4.3.3 Read and Write Control Deratings 

For a through hole design, the effect of derating on the 
control signals will be more. This section calculates that 
effect. The trace length from the CPU to the RAMs is 9 inches 
for the layout shown in Figure 4.5. Each control signal is 
connected to 8 devices. 


Number of RAM devices connected to each 
control signal = 12 (4.64) 

Input capacitance of each RAM = 5 pF (4.65) 

Total load capacitance = 1 2 x 5 = 60 pF (4.66) 

The trace length = 9" (4.67) 

Trace capacitance = 9" x 2 pF/inch = 18 pF (4.68) 

Total load capacitance = 60 + 18 = 78 pF (4.69) 

Extra load = 78 - 25 = 53 pF (4.70) 

Derating due to capacitive load = 53 / 50 = 1 .6 ns (4.71 ) 
Propagation delay due to trace length = 

9/12x2 ns/foot = 1.6 ns (4.72) 

Total derating = 1.6 + 1.6 = 3 ns (4.73) 


4.4 Timing Equations for Cache Design 

This section deals with the timing equations that enable us 
to determine the critical timing requirements of the static RAM 
that will be used as cache. These equations are based on the 


use of static RAMs (without built-in latches) as cache 
RAMs. The superscript 'd' in the following equations denote 
the deratings to be taken into account. The static RAM chosen 
for illustration here is a 16K x 4 IDT7198. The board Is 
assumed to be surface mount for all speeds of the R3001 
except for the 16 MHz speed grade. The deratings for the 
surface mount board is 2 ns and that for a through hole board 
(which is used for the 1 6 MHz R3001 ) is 3 ns. The deratings 
were derived from certain assumptions. The explanation and 
the methodology used is explained in the previous sections. In 
the following, a generalized equation is given followed by the 
timing requirementsfordifferentfrequenciesof the R3001 . All 
calculations are based on the R3001 specifications forthe four 
speed versions (1 6, 20, 25, and 33 MHz), which are found in 
the IDT data sheets. 

Figures 4.6, 4.7, 4.8, and 4.9 show the timing diagrams of 
the R3001 when it is doing a data store followed by an 
instruction fetch. This is the worst case example and is 
chosen to determine the SRAM parameter requirements. 
Figure 4.6 shows the timing diagrams for an R3001 operating 
at 1 6 MHz. Figures 4.7, 4.8, and 4.9 show the timing diagrams 
for an R3001 operating at 20 MHz, 25 MHz and 33 MHz 
respectively. The encircled numbers represent the equations 
presented in section 4.4. The timing diagram in conjunction 
with the equations are used to arrive at determining the timing 
requirements. 
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Figure 4.6. Cache Timing Diagram for a 16 MHz R3001 
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Figure 4.7. Cache Timing Diagram for a 20 MHz R3001 
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Figure 4.8. Cache Timing Diagram for a 25 MHz R3001 
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Figure 4.9. Cache Timing Diagram for a 33 MHz R3001 
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The following equations are used to determine the timing 
parameters for the static RAM so that they can function as 
cache for different operating frequencies of the R3001 . The 
numbers at the left correspond to the encircled numbers in the 
timing diagrams. Equations 9 and 1 0 are not shown in the 
timing diagram but are included for completeness. The 
equations also use some R3001 parameters. These are listed 
in Table 4.2. 

(1) Internal Sample to Phase Delay 

This is the time that the processor needs to sample the 
incoming data. Typically, for the R3001 , tsmp > 5. 

(2) RAM Address Access Time 

This equation is used to determine the Address Access 
time parameter requirements of the static RAM. From 
the timing diagram of Figure 4.9, it is easily calculated. 
As an example, let us calculate the address access time 
for a 33 MHz R3001 . The total cycle time for a 33 MHz 
R3001 is 30 ns. If the processor’s sample time require- 
ment is met the time remaining in the cycle is 24 ns. This 
time the data has to be presented to the processor. The 
processor requires a data setup time of 4 ns. There is 
also a propagation delay through the latch for the 
address bus. For the 33 MHz part, a fast FCT373C is 
used which has a maximum propagation delay of 4.7 ns 
(See Table 4.3). The derating factors due to the 
capacitance and the trace length also have to be taken 
into account. Using all these factors, the equation is: 

tRAMAA < tcyc - tsmp - tDS - t373PD - tAdrLo d -tRAMAA d 

16 MHz R3001 : tRAMAA < 60 - 1 0 - 9 - 5.2 - 3 - 3 
tRAMAA < 29.8 

20 MHz R3001 : tRAMAA <50-8-8-5.2-2-2 
tRAMAA < 24.8 

25 MHz R3001 : tRAMAA <40-6-6-5.2-2-2 
tRAMAA < 18.8 

33 MHz R3001 : tRAMAA < 30 - 4.5 - 4.5 - 4.7 - 2 - 2 
tRAMAA < 12.3 

(3) Cache Enable to Sample 

This equation is used to determine the system output 
enable(tOES) requirements of the cache RAM. This 
should meet the processor’s setup specification. The 
output enable time (tOE) specifications for the RAM is 
tested for a voltage change of 200 mV (a fall from 1 .732 
V to 1 .532 V for IDT RAMs). For a system, however, the 
voltage falls from approximately 3.3 V to 1 .5 V. This fall 
time is usually a nanosecond. Therefore the RAM 


specifications should take this system factor into 
consideration and specify the output enable time at 
least one nanosecond lower than the calculated 
timings. 

tOES < tcyc/2 - tRD d - tDS - tsys-smp + tsys-rd - tOES d 

1 6 MHz R3001 : tOES <30-3-9-10 + 10-3 
tOES < 15 

20 MHz R3001 : tOES <25-2-8-8 + 8- 2 
tOES < 13 

25 MHz R3001: tOES <20-2-6-6 + 6- 2 
tOES < 1 0 

33 MHz R3001 : tOES <15-2-4-4.5 + 4.5-2 
tOES < 7 

(4) Minimum Read Pulse Width 

This timing requirement guarantees that the read pulse 
width generated by the processor is at least as long as 
the cache RAM output-enable time. 

tOES < tcyc/2 - tsys-r d - tOES d 

1 6 MHz R3001 : tOES <30-10-3 
tOES <17 

20 MHz R3001 : tOES <25-8-2 
tOES < 15 

25 MHz R3001 : tOES <20-6-2 
tOES < 12 

33 MHz R3001 : tOES <15-4.5-2 
tOES < 8.5 

(5) Read-Write 1-Cache Data Bus Contention 

This timing requirement ensures that the RAM output is 
tristated soon enough after the instruction read signal 
goes high. In the worst case, when the processor 
performs a store operation, no data contention occurs. 

tRAMHZ < tsys - tRd d + DEn 

16 MHz R3001 : tRAMHZ < 16 - 3 + (-2.5) 
tRAMHZ < 10.5 

20 MHz R3001 : tRAMHZ < 14 - 2 + (-2) 
tRAMHZ < 10 

25 MHz R3001 : tRAMHZ < 12 - 2 + (-1 .5) 
tRAMHZ < 8.5 

33 MHz R3001 : tRAMHZ < 9 - 2 + (-1) 
tRAMHZ < 6 
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(6) Processor Data-Setup to End of Write 

This enables the designer to determine whether the 
cache RAMs have adequate data setup time when the 
processor does a store operation. In the equation, the 
minimum derating is used on the write line i.e., twr d 
because that is the worst case assumption. 

tRAMDS < tcyc/2 - tsys-smp - tDVal - tDVal d - tWr d 

16 MHz R3001: tRAMDS <30 - 10 - 3 - 3 - (-2) 
tRAMDS <16 

20 MHz R3001 : tRAMDS < 25 - 8 - 3 - 2 - (-1) 
tRAMDS < 13 

25 MHz R3001 : tRAMDS < 20 - 6 - 2 - 2 - (-1) 
tRAMDS < 1 1 

33 MHz R3001 : tRAMDS < 15 - 4.5 - 2 - 2 - (-1 ) 
tRAMDS < 7.5 


(7) Data Hold from End of Write 

This parameter requirement guarantees that the data 
hold from end of write of the cache RAM is met when the 
processor or the read buffer is writing to the RAMs. 

tRAMHD 5 tRAMLZ 

16 MHz R3001 : tRAMHD < 2 

20 MHz R3001 : tRAMHD < 2 

25 MHz R3001 : tRAMHD < 2 

33 MHz R3001 : tRAMHD < 2 


(8) Data Setup to SysClk 

This timing parameter ensures that the setup time into 
an external register (for the main memory interface) is 
sufficient enough for when the processor is doing a 
store. The data is cloc ked in th e register on the rising 
edge of the buffered SysOut (through an inverting 
FCT240A). In this equation, tsys(min) d is used to insure 
worst case calculations. 

tsetupSys < tcyc/2 - tsys - tDVal - tDVal d + tsys d +t240PDmin 

16 MHz R3001 : tSetupSys < 30-16-3-3 + 2 + 1.5 
tSetupSys < 1 1 .5 


20 MHz R3001 : tSetupSys < 25 - 1 4 - 3 - 2 + 1 + 1 .5 
tSetupSys <8.5 

25 MHz R3001 : tsetupSys < 20 - 12 - 2 - 2 + 1 + 1 .5 
tSetupSys <6.5 

33 MHz R3001 : tsetupSys < 15 - 9 - 2 - 2 + 1 + 1 .5 
tSetupSys <4.5 


(9) Data Hold from SysClk 

This timing parameter is to guarantee that the hold time 
specification for an external register is met on a proces- 
sor store. In this equation the minimum value of tRD d is 
taken to insure worst case numbers. 

tHoldSys < tsys-rd d - tsys d - t240PDmax + tRAMLZ + tRd d 

1 6 MHz R3001 : tHoldSys < 6 - 2 - 4.8 + 2 + 1 
tHoldSys < 2.2 

20 MHz R3001 : tHoldSys <6-1 - 4.8 + 2 + 1 
tHoldSys <3.2 

25 MHz R3001 : tHoldSys < 6 - 1 - 4.8 + 2 + 1 
tHoldSys <3.2 

33 MHz R3001 : tHoldSys < 4.5 - 1 - 4.8 + 2 + 1 
tHoldSys < 1 .9 


(10) Address Setup to End of Write 

This equation enables us to determine the timing re- 
quirement for the RAM so that the address set up time 
is sufficient before the trailing edge of the write pulse. 

tRAMAW < tcyc - tsmp-sys - tAdrLo d - t373PD + tWr' 1 

1 6 MHz R3001 : tRAMAW < 60-10-3-5.2 + 3 
tRAMAW < 44.8 

20 MHz R3001 : tRAMAW <50-8-2-5.2 + 2 
tRAMAW < 36.8 

25 MHz R3001 : tRAMAW <40-6-2-5.2 + 2 
tRAMAW < 28.8 

33 MHz R3001 : tRAMAW <30-4.5-2-4.7 + 2 
tRAMAW < 20.8 
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(11) Write Hold Pulse-Width 

This requirement guarantees that the cache RAMs 
minimum write pulse width specification is met. 

tFlAMPW < tcyc/2 - tWrDly 

16 MHz R3001 : tRAMPW <30-5 

tRAMPW < 25 

20 MHz R3001 : tRAMPW <25-4 

tRAMPW < 21 

25 MHz R3001 : tRAMPW <20-3 

tRAMPW < 17 

33 MHz R3001 : tRAMPW <15-2 

tRAMPW <13 


(12) Write Recovery Time 

The write recovery time is the time between the write 
pulse going inactive and the change in address. This 
characteristic is usually specified by the SRAM manu- 
facturer and is typically zero. This parameter is impor- 
tant in the R3001 cache interface and care must be 
taken to choose the proper part to prevent race condi- 
tions. In the R3001 cache design using the IDT7198 
1 6 K x 4 RAM, the latch enable is controlled by ICIk/DCIk 
and the write enable on the RAM is controlled by IWr/ 
DWr. The timing diagram shows the relationship be- 
tween the two clocks and the parameter Twr. Timing 
calculations below show that the write recovery speci- 
fications are not violated. 

Derating calculations for DCIk and DWr 

To calcula te the effect of derating on the control signals 
DCIk and DWr, the following assumptions have been 
made. 

1) The pin to pin variation on an R3001 device is 15 % 

for a 50 pF load. Under the maximum case , the 
deratings will vary from 1.7 to 2 ns for DCIk and DWr. 
Under the minimum case the deratings will vary from 
0.58 to 0.625 ns. 

2) The trace length for the DWr signal is 6 inches. 

3) The trace length for the DCIk signal is 2 inches. 

4) The trace length of the address bus to the RAM is 4 
inches. 

5) Each ICIk control signal is connected to four FCT373 
devices. 



Figure 4.10a. Circuit Showing IWr and ICIk Signals to Latch and 
SRAM 

The input and ouput capacitances for the R3001, 
IDT7198, and FCT373 can be obtained from Table 4.1. 
Figure 4. 1 Paisa simple circuit showing the connections 
of ICIk and IWr from the R3001 to the latch enable (LE) 
on the FCT373 device and Write Enable (WE) on the 
static RAM respectively. Figure 4.10b shows the tWR 
timings with respect to the data cache in an R3001 
based system. 



Figure 4.10b. Write Recovery Timing 


To prove that the Twr parameter is not violated, the 
calculations are done as follows. The derating effects 
on th e DCIk and AdrLo signal should exceed that of the 
DWr signal. The calculations are similar to the derating 
calculations in previous sections. The minimum propa- 
gation delay through the latch is considered. The 
derating on the DCIk sig nal c oming out of the 79R3001 
is lesser than that of the DWr signal . The reverse case 
is superfluous and in fact makes the situation better. 
The minimum and worst case derating effects on the 
same79R3001 have been shown. This is because the 
write recovery time parameter must not be violated over 
the entire operating range. 
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Capacitive Derations: 

(R3001 variations 15% 1.7ns - 2ns ) 

(( Cdriver + Cload + Ctrace) - Crated)/25 * (Min Or 
Max) = CLD 

Icik ((10 + 4* 10 + 2*2)- 25)/25 * 1.7 = 1.97ns 
Iwr ((10 + 8*7 + 6*2)- 25)/25 * 2 = 4.24ns 
RamAddr ((12 +8* 7 + 4*2)- 50)/25 * 0.5 = 0.52ns 

Calculations : 

Race path 1 : Iclkmin + Tpd(le) + RamAdrmin = 
(1.97 + 2 + 0.52) = 4.49 
Race path 2 : Iwrmax = 4.24 

Pathl - Path2 > Twr 

4.49 - 4.24 > 0 

Capacitive Derations: 

(R3001 variations 15% 0.58ns - 0.625ns ) 

(( Cdriver + Cload + Ctrace) - Crated)/25 * (Min Or 
Max) = CLD 

Icik ((10 + 4* 10 + 2* 2)- 25)/25 * 0.58 = 0.58ns 
Iwr ((1 0 + 8 * 7 + 6 * 2) - 25)/25 * 0.625 = 1 .325ns 
RamAddr ((12 + 8*7 + 4*2)- 50)/25 * 0.5 = 0.52ns 


Calculations: 

Race path 1 : Iclkmin + Tpd(le) + RamAdrmin = 

(0.58 +2 + 0.52) = 3.1ns 
Race path 2 : Iwrmax = 1 .325 

Pathl - Path2 > Twr 

3.1 - 1.325 = 1.775 >0 

From the above calculations and the RAM timing tables 4.3 
and 4.4, it can be seen that the data setup to the processor is 
met. The output enable of the RAM which is controlled by IRd 
goes high and the RAM output starts to go tri-state. From the 
figure, the reader may correctly question whether the hold 
time requirements of the R3001 are met. It is indeed met by 
the capacitance on the bus and also due to the fact that CMOS 
devices are being used. The technical note entitled “Meeting 
Bus Hold for the R3001” gives a more detailed explanation. 

Table 4.3 gives the timing data sheet for a typical SRAM 
device. The timing parameters correspond to a particular 
RAM configuration. Other RAM devices may have different 
timings for some of the parameters. However, there are 
certain timings that must be met. These critical parameters 
are listed in Table 4.4 and the unlisted parameters may vary 
a bit from device to device. 
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AC ELECTRICAL CHARACTERISTICS— 
COMMERCIAL TEMPERATURE RANGE 


SYMBOL 

PARAMETER 

| Clock 

TCkHigh 

Input Clock High <2) 

TCkLow 

Input Clock Low (2) 

TCkP 

Input Clock Period* 5 * 


Clk2xSys to Clk2xSmp/Rd (5) 

Clk2xSmp/Rd to C!k2xPhi (5) 


TEST CONDITIONS 


Transition < 5ns 


Transition < 5ns 


16 MHz 
MIN. MAX. 


TDen 

Data Enable (3) 

TDDis 

Data Disable (3) 

TDVal 

Data Vaild 

TWrDly 

Write Delay 

Tds 

Data Set-Up 

Tdh 

Data Hold 

Tcbs 

CpBusy Set-Up 

Tcbh 

CpBusy Hold 

TAcTy 

Access Type [1 :0] 

TAT2 

Access Type [2] 

TMWr 

Memory Write 

TExc 

Exception 


Stall Operation 


Reset Initialization 



Load = 25pF 


Load = 25pF 


Load = 25pF 


Load = 25pF 


Load = 25pF 


Load = 25pF 


TRST 

Reset Pulse Width 

Trsttag 

Reset Pulse Width, 

Pull-downs on Tag 

Capacitive Load Deration 

Cld 

Load Derate 


12.5 — 

10 — 

12.5 — 

10 — 

30 500 

25 500 

0 Tcyc/4 

0 Tcyc/4 

9 Tcyc/4 

7 Tcyc/4 


25 MHz 

MIN. MAX. UNIT 


8 — 


8 — 


20 500 I ns 


0 Tcyc/4 ns 


Tcyc/4 ns 


TSAVal 

Address Valid 

Load = 25pF 

TSAcTy 

Address Type 

Load = 25pF 

TMRdl 

Memory Read Initiate 

Load = 25pF 

TMRd 

Read Terminate 

Load = 25pF 

Tstl 

Run Terminate 

Load = 25pF 

TRun 

Run Initiate 

Load = 25pF 

TSMWr 

Memory Write 

Load = 25pF 

TSEc 

Exception Valid 

Load = 25pF 

TSEc 

DMA Drive On 

Load = 25pF 

TSEc 

DMA Drive Off 

Load = 25pF 



17 

2 

15 

2 



15 

3 

15 

3 



6 — 

6 — 

6 

Tcyc 

140 — 

140 — 

140 — 

M’S 


0.5 1 ns/25pF 


NOTES: 

1 . All timings are referenced to 1 ,5V 

2. The clock parameters apply to all three 2x clocks: Clk2xSys, Clk2xSmp/Rd and Clk2xPhi. 

3. This parameteris guaranteed by design. 

4. These parameters are illustrated in detail in the "IDT79R3001 Hardware Interface Guide". 

5. Tcyc is one CPU clock cycle (2 cycles of a 2x clock). 

6. With the exception of Run, no two signals of a given device will derate by a difference greater than 15%. 


Table 4.2. R3001 AC Specificatlons.*PLL: Phase Locked Loops 
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READ CYCLE TIMING SPECIFICATIONS 



16.7 

MHz 

20.0 

MHz 

25.0 

MHz 

33.0 

MHz 

Parameter 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

tRC 

* 

— 

* 

— 

* 

— 

* 

— 

tAA 

— 

29.8 

— 

24.8 

— 

18.8 

— 

12.3 

tACSI 

— 

* 

— 

* 

— 

* 

— 

* 

tCLZI 

* 

— 

* 

— 

* 

— 

* 

— 

tOES 

— 

15 

— 

13 

— 

10 

• — 

7 

tOLZ 

* 

— 

* 

— 

* 

— 

‘ 

— 

tCHZ1,2 

— 

* 

— 

* 

— 

* 

— 

* 

tOHZ 

— 

10.5 

— 

10 

— 

8.5 

— 

6 

tOH 

* 

— 

* 

— 

* 

— 

* 

— 

tPU 

* 

— 

* 

— 

* 

— 

* 

— 

tPD 

— 

* 

— 

* 

— 

* 

— 

* 


WRITE CYCLE TIMING SPECIFICATIONS 



16.7 MHz 

20.0 MHz 

25.0 MHz 

33.0 MHz 

Parameter 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

Min. 

Max. 

twc 

* 

— 

* 

— 

* 

— 

* — 

tcwi 

* 

— 

* 

— 

* 

— 

* 

— 

tAW 

44.8 

— 

36.8 

— 

28.8 

— 

19.8 

— 

tAS 

* 

— 

* 

— 

* 

— 

* 

— 

tWP 

25 

— 

21 

— 

17 

— 

13 

— 

tWRI 

0 

— 

0 

— 

0 

— 

0 

— 

tWR2 

0 

— 

0 

— 

0 

— 

0 

— 

tWHZ 

-- 

* 

-- 

* 

-- 

* 

-- 

* 

tDW 

16 

— 

13 

— 

11 

— 

7 — 

tDH 

* 

— 

* 

— 

* 

— 

* 

— 

tow 

* 

— 

* 

— 

* 

— 

* 

— 


NOTE: 


Table 4.4. Static RAM Parameters to Work as Cache with the R3001 

All the parameters shown are the most allowable for maximum and minimum, respectively. Numbers not shown are not critical for the 
R3001 application. 
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Parameter 

Load 

Symbol 

Min. 

Max. 

FCT373A Propagation Delay 

50 

t373 PD 

- 

5.2 

FCT373A Latch Enable Delay 

50 

t373 LE 

2 

8.5 

FCT373A Latch Enable Hold 

50 

t373 Hid 

1.8 

- 

FCT240A Propagation Delay 

50 

t240 PD 

1.5 

4.8 

FCT373C Propagation Delay 

50 

t373 PD 

1.5 

4.7 

FCT240C Propagation Delay 

50 

t240 PD 

1.5 

3.7 


Table 4.5. Timing Parameters of FCT Logic Devices 


4.4.1 Legend 


tRAMAA - 

RAM Access Time 

tRAMOE - 

RAM Output Enable Time 

tRAMHZ - 

RAM OutPut Low impedance to Output in 

High impedance 

tRAMLZ - 

RAM Output in High impedance to output in 
Low impedance 

tRAMHD - 

RAM Data Hold Time 

tDS- 

R3001 Data Setup Time 

tsys- 

Phase Difference between Clk2xSys and 
Clk2xPhi 

trd- 

Phase Difference between Clk2xPhi and 
Clk2xSmp/Rd 

tsmp- 

Phase Difference between Clk2xPhi and 
Clk2xSmp/Rd 

tcyc- 

Cycle time of the R3001 

t240PD - 

Propagation delay from Clk to Output of 
FCT240A 


5.0 USING x16 LATCHED RAMS AS CACHE 
FOR THE R3001 ON A SURFACE MOUNT 
DESIGN 

5.1 Assumptions for Surface Mount Design Layout Using 
x16 Latched RAMs as Cache for the R3001 

In this chapter, the RAM timings are calculated for a 4K X 
16 IDT71586 which have the latches built in. For the static 
RAMs with latches built in, the address access times tRAMAA, 
and the address setup to end of write tRAMAW will change from 
those of a regular static RAM. The propagation delay due to 
the latches is eliminated increasing the access time and the 
address setup to end of write by about 5 ns. In addition the 
board layout is different because the distances from the CPU 
to the RAM is reduced. This decreases the derating factors by 
a finite amount. This chapter calculates the derating factors 
for an R3001 cache design using the IDT71586 as cache. 
These are the following assumptions: 

1) The trace has a capacitance of 2 pF/inch 

2) The speed of light is 2 ns/foot in epoxy. 

3) The R3001 speeds are specified with a loading of 
25pF. For every additional 25pF, there is a delay of 

1 ns. Note that the cache control signals are specified 
with a 50pF load and derate 1 ns/25pF after that. 

4) The distances between the R3001 and the latches 
are approximately 5 inches. 

5) The distances between the R3001 and the RAMs are 
approximately 2 inches each. 

6) In all of the assumptions, it is assumed that a surface 
mount package is used. The input capacitance of the 
RAMs is a typical value (7 pF) for a PLCC package. 
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The instruction and data caches are designed using the 
IDT71586 latched SRAMs. The system memory is assumed 
tobe 256MB. Dataparityonthe R3001 isdisabled. The upper 
four tag bits are also not compared and their comparision is 
turned off at reset time using pull down resistors on the Tag 
(31 :28) bits. Therefore the cache data format is 48 bits wide. 
The instruction and data caches are built using six IDT71 586 
- three for instruction cache and three for the data cache. 

5.2 Derating Calculations Using IDT71586 as Cache 
RAMS 

The derating factors for the IDT71586 cache RAMs follow 
the same methodology as explained in Chapter 4. The cache 
size is 4K words for instruction and 4K words for data. The 
latches are eliminated. The derating factors for the address 
and data bus are calculated. 



Figure 5.1. 71586 Used as Cache RAMs for the R3001 
Cache Size = 16KB 


Figure 5.1 shows a cache system for the R3001 with the 
latched RAMs i.e., IDT71 586 as the cache. There are a total 
of 6 such devices required for 16KB each of instruction and 
data cache. 


D-Cache 

Tag 


R3000 


l-Cache 

Tag 


R3010 



( 


Figure 5.2. Surface Mount Board Layout of an R3001 System 
Using IDT71586 as Cache and 
Approximate Distances Between Devices 


Figure 5.2 shows an example layout of an R3001 surface 
mount design board using latched RAMs (IDT71586) as 
cache for the R3001 system. The distance between the 
R3001 data pins and the caches is about 2 inches. The total 
trace length for the address bus and the data bus is about 4 
inches each. 

5.2.1 Address Bus Derating Calculations 

Each AdrLo bus is connected to eight latched RAMs i.e., 
the IDT71586 and the address latch for main memory writes 
and reads. (Figure 5.3) 



Figure 5.3. Number of Devices Address Bus is Connected To 


Trace length from the CPU to the address latch (for main 
memory) = 4 inches (5.1) 

Capacitance of the trace = Ctrace = 4x2 pF/inch = 8 pF (5.2) 
Input capacitance of the 373 latch = 10 pF (5.3) 

Total input capacitance due to 6 RAM devices = 6 x 7 = 42 
pF (5.4) 

Total capacitance due to the load = 8 + 42 + 10 = 60 pF (5.5) 
The rated R3001 load = 25 pF (5.6) 

Extra loading on the R3001 = 60 - 25 = 35 pF (5.7) 

The delay can be calculated as follows. 

For every extra 25 pF of load, there is a delay of 1 ns (5.8) 

From Eq. 5.7 and Eq. 5.8, delay due to the capacitive load 
= 35 / 25 = 1 .4 ns (5.9) 

The speed of light = 2 ns/foot (5.10) 

For a maximum path length of 3", delay = 3712” x 2 = 0.5 
ns (5.11) 

From Eq. 5.9 and Eq. 5.1 1 , 

Total propagation delay for the address bus = 1 .4 + 0.5 = 2 
ns (5.12) 

From the above calculations, it is seen that the derating on the 
address bus is 2 ns. 

5.2.2 Data Bus Derating Calculations 

From Figure 5.4, it is seen that the data bus is connected 
to the floating point unit (R301 0), two 71 586 devices, one read 
register (FCT374A), and one write register (FCT823B). As in 
the previous chapter where we considered a 16 K x 4 static 
RAM, we have to calculate the deratings for two cases: i) for 
an instruction fetch, and ii) for a data store. 
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Bit DO 



Figure 5.4. Devices Data Bus of the R3001 Is Connected to 

5.2.2.1 Data Store (R3001 outputs data) 

Each data bit is connected to two RAM devices — one tor 
instruction and one for data. 

The path length of the data bus = 4 inches. (5.13) 

Trace capacitance of the data bus = 4x2 pF/inch = 8pF 

(5.14) 

Capacitive loading on the data bus due to the different 
devices = 2 X CRAMin + CR3010in + C374out + C823in = 2x7 


+ 10 + 12 + 10 = 46 pF (5.15) 

Total capacitive load = 

Cdevices + Ctrace = 46 + 8 = 54 pF (5.1 6) 

Propagation delay due to speed of light = 

4712" x 2 = 0.6 ns (5.17) 

Delay due to capacitive load = (54 - 25) /25 = 1.16 ns (5.18) 
Total delay = 1.16 + 0.7 = 1.8 ns « 2 ns. (5.19) 

5.2.2.2 Load Data into R3001(RAM outputs data) 

Since the trace length is the same, the trace capacitance 
Ctrace = 8 pF. (5.20) 

Capacitive load = CR3001in + CR3010in + C71586in + C374in + 
C823out (5.21) 

Cdevices = 10 + 10 + 7 + 12 + 10 = 49 pF (5.22) 

Ctotal = 49 + 8 = 57 pF (5.23) 

The RAM rated drive = 30 pF (5.24) 

Propagation delay due to extra capacitive loading = (57 - 
30) / 25 = 1 .08 ns (5.25) 

Propagation delay due to path length = 0.8 ns (5.26) 

Total propagation delay = 1 .08 + 0.8 = 2 ns (5.27) 


5.2.3 Read and Write Control Derating Calculations 

The effect of the capacitance on the control signals from the 
R3001 processor to the caches and the memory interface is 
cons idere d here. T he co ntrol signals on the R3001 are the 
IRd, DRd, IWr, and DWr which control the instruction cache 
read, data cache read, instruction cache write, and data cache 


write respectively. The read and write control signals are 
connected to the output enable (OE), and write enable (WE) of 
the instruction and data cache respectively. Two control 
signals each are provided for the read and write operations of 
each of the caches. Assuming the use of a4Kx 16IDT71586 
static RAM, each control signal is connected to 3 such static 
RAMs. 

Number of devices (SRAM) connected to each control line 


= 3 (5.28) 

Input capacitance of each device (SRAM) = 5 pF (5.29) 
Total load capacitance = 3 x 5 = 15 pF (5.30) 

Path length = 4" (5.31) 

Trace Capacitance = 4x2 pF/in = 8 pF (5.32) 

Total capacitance = 15 + 8 = 23 pF (5.33) 


There is no extra capacitive loading here as the rated 
R3001 load is 50 pF. 

Propagation delay due to the trace length = 0.8 ns (5.34) 
Total propagation delay = 0.8 = 1 ns (5.35) 

5.3 Timing Equations for Cache Design 

This section deals with the timing equations that enable us 
to determine the critical timing requirements of the static RAM 
that will be used as cache. These equations are based on the 
use of static RAMs without built-in latches as cache RAMs. 
The superscript ‘d’ in the following equations denote the 
deratings to betaken into account. The static RAM chosen for 
illustration here is a 4K x 1 6 IDT71 586. The board is assumed 
to be surface mount for all speeds of the R3001 . The deratings 
for the surface mount board is 2 ns. The deratings were 
derived from certain assumptions. The explanation and the 
methodology used is explained in the previous sections. In the 
following, a generalized equation is given followed by the 
timing requirementsfordifferentfrequencies of the R3001. All 
calculations are based on the R3001 specifications forthe four 
speed versions (16, 20, 25, and 33 MHz), which are found in 
the IDT data sheets. 

(1) Internal Sample to Phase Delay 

This is the time that the processor needs to sample the 
incoming data. Typically, for the R3001 , tsmp > 5 

(2) RAM Address Access Time 

This equation is used to determine the Address Access 
time parameter requirements of the static RAM. From 
the timing diagram of Figure 5.4, it is easily calculated. 
The total cycle time for a 33 MHz R3001 is 30 ns. If the 
processor’s sample time requirement is met, the time 
remaining in the cycle is 24 ns in which the data has to 
be presented to the processor. The processor requires 
a data setup time of 4 ns. The derating factors due to 
thecapacitance and the trace length have also to be 
taken into account. Using all these factors, the equation 
is, 
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tRAMAA < tcyc - tsmp - tDS - tAdrLod - tRAMAAd 
16 MHz R3001 : tRAMAA < 60 - 10 - 9 -3-3 
tRAMAA < 35 

20 MHz R3001 : tRAMAA <50-8-8-2-2 
tRAMAA < 30 

25 MHz R3001 : tRAMAA <40-6-6-2-2 
tRAMAA < 24 

33 MHz R3001 : tRAMAA <30-4.5-4.5-2-2 
tRAMAA < 17 

(3) Cache Enable to Sample 

This equation is used to determine the output enable 
requirements of the cache RAM and should meet the 
processor’s setup specification. The output enable time 
for the latched RAM is specified by the manufacturer 
and tested for a voltage change of 200 mV (1 .732 V to 
1 .532 V for IDT RAMs). For a system the voltage falls 
from a level of 3.3 V to 1 .5V and the added fall time must 
be considered when specifying the RAM tOE parameter. 
This fall time is approximately an additional nanos- 
econd. Therefore the RAM tOE parameter should be 
one nanosecond lower than the calculated numbers 
below. 

tOES < tcyc/2 - tRD d - tDS - tsys-smp + tsys-rd - tOES d 


16 MHz R3001: tOES <30-2-9-10 + 10-3 
tOES < 1 6 

20 MHz R3001 : tOES <25-1-8-8 + 8- 2 


(4) Minimum Read Pulse Width 

This timing requirement guarantees that the read pulse 
width generated by the processor is at least as long as 
the cache RAM output enable time. 

tOES < tcyc/2 - tsys-rd - tOES d 

16 MHz R3001 : tOES <30-10-3 

tOES < 1 7 

20 MHz R3001 : tOES <25-8-2 
tOES < 1 5 

25 MHz R3001 : tOES <20-6-2 
tOES < 1 2 

33 MHz R3001 : tOES <15-4.5-2 
tOES < 8.5 

(5) Read-Write 1-Cache Data Bus Contention 

This timing requirement ensures that the RAM output is 
tri-stated soon enough after the instruction read signal 
goes high. In the worst case, when the processor 
performs a store operation, no data contention oc- 
curs. 

tRAMHZ < tsys - tRd d + tDEn 
16 MHz R3001 : tRAMHZ < 16 - 2 + (-2.5) 
tRAMHZ < 1 1 .5 

20 MHz R3001 : tRAMHZ <14-1 + (-2) 
tRAMHZ < 11 


tOES < 14 

25 MHz R3001 : tOES <20-1-6-6 + 6- 2 
tOES < 11 

33 MHz R3001 : tOES <15-1-4-4.5 + 4.5-2 
tOES < 8 


25 MHz R3001 : tRAMHZ <12-1 +(-1.5) 
tRAMHZ < 9.5 

33 MHz R3001 : tRAMHZ < 9 - 1 + (-1 ) 


tRAMHZ < 7 

(6) Processor Data-Setup to End of Write 

This enables the designer to determine whether the 
cache RAMs have adequate data setup time when the 
processor does a store operation. In the equation, the 
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minimum derating is used on the write line i.e., twr d 
because that is the worst case assumption. 

tRAMDS < tcyc/2 - tsys-smp - tDVal - tDVal* - tWr d 

16 MHz R3001 : tRAMDS < 30 - 10 - 3 - 3 - (-2) 

tRAMDS < 16 

20 MHz R3001: tRAMDS < 25 - 8 - 3 - 2 - (-1) 
tRAMDS < 13 

25 MHz R3001 : tRAMDS < 20 - 6 - 2 - 2 - (-1) 
tRAMDS < 1 1 

33 MHz R3001 : tRAMDS < 15 - 4.5 - 2 - 2 - (-1) 
tRAMDS < 7.5 

(7) Data Hold from End of Write 

This parameter requirement guarantees that the data 
hold from end of write of the cache RAM is met when 
the processor or the read buffer is writing to the RAMs. 

tRAMHD < tRAMLZ 

16 MHZ R3001 : tRAMHD < 2 

20 MHZ R3001 ; tRAMHD < 2 

25 MHz R3001 : tRAMHD < 2 

33 MHz R3001 : tRAMHD < 2 

(8) Data Setup to SysClk 

This timing parameter ensures that the setup time into 
an external register (for the main memory interface) is 
sufficient enough for the case when the processor is 
doing a store. The data is clocked in the register on the 
rising edge of the buffered SysOut (through an inverting 
FCT240A). In this equation, tsys(min) d is used to insure 
worst case calculations. 

tsetupSys < tcyc/2 - tsys - tDVal - tDVal d + tsys d + t240PDmin 
16 MHz R3001: tSetupSys < 30-16-3-3 + 2 + 1.5 
tSetupSys < 1 1 .5 

20 MHz R3001 : tsetupSys < 25 - 1 2 - 3 - 2 + 1 + 1 .5 
tSetupSys < 1 0.5 


APPLICATION NOTE-77 

25 MHz R3001 : tsetupSys < 20- 12-2-2 + 1 +1.5 
tSetupSys < 6.5 

33 MHz R3001 : tsetupSys <15-9-2-2 + 1+1. 5 
tSetupSys <4.5 

(9) Data Hold from SysClk 

This timing parameter is to guarantee that the hold time 
specification for an external register is met on a proces- 
sor store. In this equation the minimum value of tRD d is 
taken to insure worst case numbers. 

tHoldSys < tsys-rd - tsys d - t240PDmax + tRAMLZ + tRd* 

1 6 MHz R3001 : tHoldSys < 6 - 2 - 4.8 + 2 + 1 
tHoldSys < 2.2 

20 MHz R3001 : tHoldSys < 6 - 1 - 4.8 + 2 + 1 
tHoldSys < 3.2 

25 MHz R3001 : tHoldSys < 6 - 1 - 4.8 + 2 + 1 
tHoldSys < 3.2 

33 MHz R3001 : tHoldSys < 4.5 - 1 - 4.8 + 2 + 1 
tHoldSys < 1 .9 

(10) Address Setup to End of Write: 

This equation enables us to determine the timing re- 
quirement for the RAM so that the address set up time 
is sufficient before the trailing edge of the write pulse. 

tRAMAW < tcyc - tsmp-sys - tAdrLo d + tWr* 

16 MHz R3001 : tRAMAW < 60-10-3+2 

tRAMAW < 49 

20 MHz R3001 : tRAMAW <50-8-2 + 1 
tRAMAW < 41 

25 MHz R3001 : tRAMAW <40-6-2 + 1 
tRAMAW < 33 

33 MHz R3001 : tRAMAW <30-4.5-2 + 1 
tRAMAW < 24.5 
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(11) Write Hold Pulse Width: 

This requirement guarantees that the cache RAMs 
minimum write pulse width specification is met. 

tRAMPW < tcyc/2 - tWrDly 

16 MHz R3001:tRAMPW<30-5 

tRAMPW < 25 

20 MHz R3001 : tRAMPW <25-4 

tRAMPW < 21 

25 MHz R3001 : tRAMPW <20-3 

tRAMPW < 17 

33 MHz R3001 : tRAMPW <15-2 

tRAMPW < 13 

From the above calculations and Figure 5.3, it can be 
seen that the data setup to the processor is met. The 
output enable of the RAM which is controlled by IRd 
goes high and the RAM output starts to go tri-state. 
From the figure, the reader may correctly question 
whether the hold time requirements of the R3001 are 
met. It is indeed met by the capacitance on the bus and 
also due to the fact that CMOS devices are being used . 
The technical note entitled “Meeting Bus Hold for the 
R3001" gives a more detailed explanation. 


From Figure 5.5, it is clearly seen that the address 
setup and hold time for the latched RAMs are met by 
using ICIk to capture the instruction address. Figure 
5.5 is to illustrate the timings for a 25 MHz IDT71586 
latched RAM. Similartiming diagrams can be drawn to 
verify the setup and hold times for R3001 operating at 
different frequencies. 

5.3.1 Legend 

tRAMAA - RAM Access Time 

tRAMOE - RAM Output Enable Time 

tRAMHZ - RAM OutPut Low impedance to Output in 
High impedance 

tRAMLZ - RAM Output in High impedance to output in 
Low impedance 

tRAMHD - RAM Data Hold Time 

tDS - R3001 Data Setup Time 

tsys - Phase Difference between Clk2xSys and 
Clk2xPhi 

trd - Phase Difference between Clk2xPhi and 
Clk2xRd 

tsmp- Phase Difference between Clk2xPhi and 
Clk2xSmp 

tcyc - Cycle time of the R3001 
tsmp-rd = tsmp - trd 

t240PD - Propagation delay from Clk to Output of 
FCT240A 


phase 2 phase 1 



Figure 5.5. Address Setup and Hold Timing for a Latched RAM (25 MHz R3001) 



9.14 


23 




R3001 SPECIFICATION & CACHE RAM TIMINGS 

MICROPROCESSOR INTERFACE GUIDE APPLICATION NOTE-77 


Parameter 

Description 

16 MHz 

20 MHz 

25 MHz “j 

33 MHz 




Min. Max. 

tRC 

Read cycle 

35 

30 

25 

15 

tCH 

ALEN high 

10 

10 

10 

8 

td 

ALEN low 

10 

10 

10 

8 

tAS 

Adr Latch set-up 

5 

5 

5 

4 

tAH 

Adr Latch Hold 

5 

5 

5 

4 

tAA 

Address Access 

35 

30 

24 

17 

tACE 

Chip Enable access 

35 

30 

25 

17 

tOES 

Output enable 

16 

14 

11 

8 

tCLZ 

CE to out in LZ 

3 

3 

3 

3 

toLZ 

OE to out in LZ 

2 

2 

2 

2 

tcHZ 

CE to out in HZ 

25 

22 

20 

15 

tOHZ 

OE to out in HZ 

11 

11 

9 

7 

tOH 

Output Hold from 
address in change 

3 

3 

3 

3 


Table 5.1. Read Cycle Timings for an IDT Static RAM with Latches 


Parameter 

Description 

16 MHz 

20 MHz 

25 MHz 

33 MHz 


Min. Max. 


Min. Max. 

twc 

Write cycle 

35 

30 

25 

15 

tCH 

ALEN high 

10 

10 

10 

8 

td 

ALEN low 

10 

10 

10 

8 

tAS 

Adr Latch set-up 

5 

5 

5 

4 

tAH 

Adr Latch Hold 

5 

5 

5 

4 , 

tAW 

Address to end of write 

35 

30 

25 

15 

tASW 

Address set-up 

0 

0 

0 

0 

twp 

Write pulse width 

25 

20 

17 

1 1 

tew 

CE to end of write 

25 

20 

20 

11 

tWR 

Write recovery 

0 

0 

0 

0 

tWHZ 

Write to out in HZ 

15 

15 

13 

8 

tDW 

Data setup 

16 

13 

11 

7 

tDH 

Data Hold 

0 

0 

0 

0 

tow 

Out active from end of 
write 

5 

5 

5 

5 


Table 5.2. Write Cycle Timings for an IDT Static RAM with Latches 
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INTRODUCTION 

The inherent flexibility of the IDT 79R3000 family architecture al- 
lows system designers to make a wide variety of system tradeoffs. 
Therefore, it is extremely important that effective development 
tools be available for each step of the design. This need starts 
when the initial architectural decisions and performance simula- 
tions are made and carries through to the final stages of hardware- 
software integration and system diagnostics development. 

This paper describes the System Programmer’s Package (SPP) 
for the IDT 79R3000, an effective set of tools available for each 
part of the design process. These tools allow designers to make 
tradeoffs in both hardware and software which result in systems 
with very cost-effective performance, and also speeds the overall 
development of the target application. 

PROCESSOR OVERVIEW 

The architecture of the IDT 79R3000 family is partitioned into a 
processor and a separate high-performance floating point unit. 
The processor, which can sustain 20 VAX MIPS performance, 
consists of two tightly-coupled processors implemented on a 


single ship. The first processor is a full 32-bit CPU utilizing RISC 
techniques to achieve a new standard of microprocessor perform- 
ance; the second processor is a system control coprocessor 
(CPO), containing a Translation Lookaside Buffer (TLB) and con- 
trol registers to support a 4 GByte virtual memory subsystem. Also 
integrated onto the processor chip is a dual-cache controller which 
controls separate direct-mapped instruction and data caches. 
Each cache can independently vary in size from 4K bytes through 
256K bytes, with independently selectable block refill sizes of 1 to 
32 words. The processor achieves 200 MBytes/ second cache 
bandwidth at 25 MHz. Figure 1 illustrates the interaction between 
the CPU, its coprocessors, and the memory subsystems. 

The full performance of the 79R3000 is achieved by the proper 
integration of software and hardware. The 79R3000 contains a 
highly efficient five-stage pipeline, with each stage of the pipeline 
controlling a different CPU resource. Optimizing compiler technol- 
ogy serves to both reduce the number of instructions required to 
perform a given task, and also serves to efficiently schedule in- 
structions. This eliminates processor stalls which might arise if a 
hardware interlock is activated and eliminates NOP instructions 
during the latency cycles of LOAD or BRANCH operations. 



Figure 1. 79R3000 Based System 


c 1989 Integrated Device Technology, Inc. 
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ARCHITECTURAL DECISION PHASE 

In order to determine the system configuration which yields the 
best price/performance tradeoff for a given application, the system 
designer has to make a number of decisions. For example, the de- 
signer needs to select the cache configuration which best supports 
the application requirements. The 79R3000 allows the designerto 
choose various cache depths from 4K bytes to 256K bytes, and to 
select an appropriate block refill size, for both the instruction and 
data caches. 

The optimal cache size and block size is a function of the locality 
of the typical programs running on the system, as well as the CPU 
sub-system’s latency to main memory when a cache miss occurs. 
In general, the longer the latency to main memory the longer the 
CPU will be stalled for a cache miss and thus the more severe the 
penalty. This means that in order to overcome the effects of a long 
memory latency, a large cache and a fairly large block size are 
needed to support high performance. Systems with a relatively low 
latency to main memory may be able to tolerate a smaller, less ex- 
pensive cache subsystem and a smaller block size. 
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Figure 2. Using Cache-2000 to Determine 
Optimum System Configuration 

Before going to the expense of drawing schematics or building 
boards, the system designer can use a set of programs contained 
in the SPP which simulates various cache sizes and refill sizes. 
Since the typical bus latency of the target system is included during 
the simulations, a relatively accurate model of anticipated perform- 
ance is available. This capability is contained in the Cache-2000 
module as part of the SPP; provided in source form, this package 
allows the system designerto describe the overall system and then 
benchmark typical target software (as shown in Figure 2). In this 
manner, a decision about the allowable tradeoffs in cost and per- 
formance can be made before the detailed design is begun. 

SOFTWARE DEVELOPMENT PHASE 

Software development is increasingly a bottleneck in system de- 
velopment. The SPP contains a number of utilities which facilitate 
the parallel development of software with hardware, allowing much 
of the software can be developed long before the target hardware 
is available. 

Key to this is the stand-alone software simulator package, Sa- 
ble, which models the CPU, FPA, and the entire memory hierarchy 
of TLB, Cache, main memory, disk and system console. Sable 
simulates the execution of 79R3000 instructions in an environment 
which mimics the target system. Further customization is possible, 
as the simulation package is provided in source form. 


While Sable models things such as disks simply as a source or 
sink for data files without modeling the physical performance of a 
disk (latency, etc.), Sable does provide sophisticated models of the 
TLB and cache. Sable keeps track of the contents of the TLB and 
of each cache entry after every cycle. This allows debugging of 
kernels, diagnostics and other programs which might require 
knowledge of the privileged, supervisor state as well as facilitates 
debugging of user programs which may execute on top of an oper- 
ating system. 

Thus, by using the Sable architecture simulator, the software for 
the target system can be developed concurrently with the hard- 
ware itself. It is worth noting that MIPS Computer Systems uses 
Sable to debug its operating systems for new computers, and typi- 
cally has the software working months in advance of the target 
hardware. 

Other utilities are provided to minimize the software develop- 
ment effort. The Stand-Alone I/O library, SAIO, is designed to per- 
form the functions of the stdio standard ”G" library. The SAIO li- 
brary contains Unix system-like routines that access disks, Ether- 
net, tapes, and UART devices and also includes routines to per- 
form fault handling and cache flushing. The stand-alone compiler 
system uses the SAIO library to produce code that will execute in 
the target stand-alone environment. The entire stand-alone ap- 
plications program is then constructed ”on-top” of the modules 
provided by the SPP, as shown in Figure 3. 
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Figure 3. Using SPP to Develop Stand-Alone Applications 
on the Host System 

Code can be developed in assembly language, ”C”, Pascal, For- 
tran, or even PL/1 or Cobol; the software development tools work 
equally well with any of MIPS standard languages. MIPS simula- 
tion debugger, sdbx, provides source level debugging across the 
entire language suite, tracing every machine action back to the 
original source statement. Sdbx provides the ability to examine 
and modify the state of the machine as well as standard debug fa- 
cilities such as single-step or breakpoint. 

Finally, another utility that comes standard with the MIPS RISC/ 
os UNIX operating system, called PIXIE, provides extensive profil- 
ing of the target code. PIXIE dynamically profiles the execution of 
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the target software, so that the performance of the application can 
be tuned and so that all of the modules can be fully tested. PIXIE 
works on the binary image of the program, and thus does not re- 
quire modification to the source code in order to be used. 

HARDWARE/SOFTWARE INTEGRATION 

The SPP toolkit also benefits the process of integrating software 
onto the target hardware. The SPP provide utilities which connect 
the target to the host development machine for final debugging, 
and also provides a number of software modules designed to be 
incorporated into the target to facilitate integration. 

The SPP includes a PROM monitor which provides system diag- 
nostics and initialization of the machine. Provided in source form, 
the monitor is designed to initialize the cache, reset the machine 
state, and transfer control either to a shell program or the applica- 
tions program. The monitor is responsible for storing communica- 
tions parameters for the console port and maintaining other sys- 
tem information. Utilities are provided to facilitate the compilation 
of the monitor program and to help burn it into PROMs. The PROM 
monitor allows basic commands to examine and modify system 
memory, and provides primitive debug support which is typically 
retained in the customer end product. 

In addition to monitor functions, the PROM monitor also contains 
a set of power-on diagnostics which exercises the basic function- 
ality of the CPU, FPA and memory. These diagnostics are also 
often incorporated into the customer end product. Once the target 
system can correctly execute the diagnostic suite, the application 
is known to communicate effectively with the cache and the basic 
operation of the CPU subsystem is confirmed. 
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The Monitor and Stand-Alone Shell, SASH, extends the func- 
tionality of the simple PROM monitor. The monitor functions are 
partitioned between the PROM Monitor and SASH so that basic 
functions provided by the PROM monitor can be kept in a small 
PROM, while the extended SASH functions can then be executed 
out of RAM. 

The ’dbgmon’ debug monitor is designed to be incorporated into 
the target machine. It provides hooks which allow the host to re- 
motely debug the target machine, and works with the Mips host 
system debugger, pdbx, which is a source level symbolic debug- 
ger configured to operate remotely. This duo performs remote de- 
bugging, including the ability to breakpoint, single step, and exam- 
ine the execution state of the target. Beyond disassembly, the 
debugger maps machine instructions to the source statements 
which generated them, and prints variables and addresses 
symbolically. 

Finally, in order to debug the target through the host develop- 
ment vehicle, communications software is also provided. Two 
RS232 ports in the target system aid development; one for the tar- 
get systems’ console monitor and one for the remote debug ac- 
cess to the target from the host. In addition to RS232 capability, 
drivers are also provided for Ethernet. Once the initial debug of the 
program is completed, Ethernet can be used to greatly speed the 
transfer of programs and data from the host. A Bootfile Server 
Driver is provided on the host to download bootable images to the 
target across the Ethernet. The SPP provides driver routines in 
source form, so that if necessary the drivers can be modified. Fig- 
ure 4 illustrates the use of the SPP in debugging the stand-alone 
application on the target system. 

HOST DEVELOPMENT SYSTEMS 

The SPP has been designed to run on a variety of development 
hosts, provided that the host utilizes the R3000 processor. By run- 
ning on a native mode platform, using a toolkit that was developed 
by the same people that developed the processor itself, there is no 
danger of errors in the interpretation of the workings of the 
processor. 
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Figure 6. The Systems Programmer’s Package for the IDT 79R3000 


Figure 4. Using the SPP During Hardware/Software 
Integration 
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Platforms for the SPP include MIPS’ M Series of RISC comput- 
ers, which are powerful (1 0 to 20 VAX MIPS) multi-user UNIX envi- 
ronments, and also includes the 7RS201 Macintosh-ll develop- 
ment card available from IDT. The choice of host development 
machine is primarily determined by the number of developers of 
the application. Figure 5 illustrates the range of SPP hosts 
available. 

CONCLUSIONS 

The IDT 79R3000 is a very powerful microprocessor based upon 
RISC principles. The processor architecture is supported by a very 
powerful toolset, the SPP, which supports product development 
during all phases of the design process. 

During the initial architectural phase, the SPP provides tools 
which enable the system designer to make appropriate tradeoffs in 
cache design to achieve very cost effective performance for the 
end application. 

Software development can then proceed concurrently with hard- 
ware development. The SPP includes language and library tools 
to facilitate development, as well as a powerful symbolic debugger 
for the software debug effort. An entire simulation environment 
simulates the processor, memory, and I/O subsystems. It is used 
to fully simulate the target system before it is even fully designed, 
and allows the target operating system and software to be de- 
bugged on an existing, functional platform. 

Finally, the process of integrating software onto the target hard- 
ware is facilitated by modules included in the SPP. Source code 


for communications drivers, diagnostics, and a debugger and sys- 
tem monitor allow very powerful debug support as the software is 
brought up on the target hardware. 

Altogether, the SPP enables applications to be developed for the 
79R3000 in a very timely fashion, and allows the system designers 
to concentrate on developing the target system rather than on de- 
veloping tools. The SPP provides the user with total control over 
the development process. Its power has been demonstrated at 
MIPS Computer Systems, where it is used to develop operating 
systems for new computers. 
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Figure 6. The Systems Programmer’s Package for the IDT 
79R3000 
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INTRODUCTION 

Today’s high-speed RISC microprocessors bring new levels of 
performance to todays’ applications. This performance is a blend 
of the raw system speed and of the architecture of the processor. 
However, these processors require a different design methodol- 
ogy than the familiar techniques applicable to lower speed CISC- 
type microprocessors. Todays’ processors feature tools rich in 
software content, appropriate to the software-intensive nature of 
RISC. 

The IDT 79R3000 RISC Microprocessor family is richly sup- 
ported by a wide variety of development tools, benefiting all stages 
of the development process. These tools allow designers to maxi- 
mize performance, minimize cost, and significantly reduce time to 
market. Additionally, the tools are flexible enough to be well suited 
to the requirements of either reprogrammable or embedded type 
applications. 


ARCHITECTURE OVERVIEW 

The IDT 79R3000 is a high-performance microprocessor using 
RISC techniques to achieve 27 MIPS performance in a 33MHz 
system. The 79R3000 architecture was developed by MIPS Com- 
puter Systems as an evolution of work originally begun at Stanford 
University, and is a second generation implementation of the MIPS 
instruction set architecture. 

The 79R3000 achieves this high-performance through the com- 
bination of low average cycles per instruction (typical of RISC proc- 
essors) and high instruction and data bandwidth. The 79R3000 
consists of two tightly-coupled processors on a single chip. The 
32-bit integer RISC CPU is complemented by CPO, the on-chip 
system control coprocessor containing an MMU with 64-entry, 
fully associative TLB, processor control and status registers, and 
an on-chip cache controller featuring 267MBytes/second of proc- 
essor bandwidth using industry standard static RAMs. 



Figure 1. Typical 79R3000 Based System 
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The architecture features a separate floating point accelerator, 
the 79R3010, which executes at 9.3 single precision Unpack 
MFLOPS (4.3 MFIops double precision). Also frequently used is 
the 79R3020 write buffer, which allows the processor to execute 
stores to main memory at the processor cycle rate, rather than the 
main memory rate. Figure 1 illustrates a typical 79R3000 based- 
system. 

The full performance of the 79R3000 family is a result of bal- 
anced integration between software and hardware. The integer 
CPU contains a five-stage pipeline, therefore executing five in- 
structions concurrently and reducing the average time per instruc- 
tion. Optimizing compiler technology serves to both reduce the 
number of dynamic instructions required to execute a given task, 
and also to insure that the various resources of the CPU are fully 
utilized. Examples of compiler optimization include scheduling in- 
structions to eliminate latency effects in branch or load instruc- 
tions, and also scheduling resources to minimize the occurance of 
hardware interlocks which might arise if an access to a busy re- 
source is attempted. 

ON-CHIP CACHE CONTROL 

The 79R3000 contains an on-chip cache controller which con- 
trols separate Instruction and Data Caches. Each cache can vary 


in depth from 4k Bytes through 256kBytes. Additionally, each 
cache can have independent cache refill block sizes of 1 through 
32 words (block refill refers to how much data is retrieved from 
main memory when processing a cache miss; block refill relies on 
the principle of locality of reference to improve net processor per- 
formance by amortizing the expense of going to slower main mem- 
ory over a number of instruction or data elements likely to be 
needed). 

The on-chip cache controller implements separate, direct- 
mapped Instruction and Data caches using industry standard 
static RAM devices such as the IDT 7198 (16k x 4) or IDT 71586 
(4k x 1 6 with integrated address latch), using a single address and 
single data bus. External latches capture the address for the 
cache access, while the 79R3000 provides signals to directly con- 
trol the address latching, RAM output enable, and RAM write con- 
trol for each cache. The tag comparison occurs on-chip, simulta- 
neous with the data access, minimizing the amount of time neces- 
sary for the cache control function. Thus, it is very simple to imple- 
ment a highly— efficient cache for the 79R3000 while minimizing 
the amount of logic needed to implement the cache control. Figure 
2 illustrates the design of 64kBytes each of Instruction and Data 
Cache, a configuration common in workstation applications. 
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Figure 2. 64kBytes of instruction and Data Cache 
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ASYNCHRONOUS MEMORY INTERFACE 

The 79R3000 also incorporates a simple, flexible interface to 
main memory resources, including RAM, PROM, and I/O. This in- 
terface is supported through the use of the Asynchronous Memory 
Interface, which is used whenever uncached data is required (e.g. 
cache miss processing, uncacheable memory, I/O devices, etc.). 

The asynchronous interface consists of control signals for read- 
ing and writing main memory. The processor asserts its request 
signal (e.g. MemRd, which indicates a main memory read), and 
main memory throttles the processor with the appropriate control 
signal (e.g. RdBusy, which says that the required data is not yet 
available). Further information about the type of transaction is 
available from the Access Type bus, which indicates the size and 
cause of the data access. The processor automatically updates 
the contents of its caches when main memory traffic was initiated 
by a cache miss. 

The 79R3000 can be used with a wide variety of memory inter- 
faces. In compute server type systems, the asynchronous inter- 
face is typically used as part of a backplane bus arbiter circuit. The 
memory required by the processor then typically takes many cy- 
cles before the first transaction can be completed, while immedi- 
ately subsequent transactions take considerably fewer cycles (the 
79R3000 block refill mechanism takes advantage of this fact). In 
embedded systems, the processor typically has much more direct 
control of its main memory resources, and these signals are used 
with a simple state machine to facilitate transfer activity between 
the main memory bus and the CPU-cache bus. Thus, the 
79R3000 asynchronous memory interface is flexible enough to 
satisfy a wide variety of applications. 

With all of this flexibility, it is important that the system designer 
have a rich toolset available to assess the implications of certain 
design decisions. This toolset must account for the application 

Instruction #1 


constraints in terms of cost, size and performance. The IDT 
79R3000 is supported by tools which allow system designers to 
evaluate their proposed system in advance of committing to PC 
board, and to verify that the system design goals are achieved. 

PERFORMANCE CONSIDERATIONS 

This architecture results in the highest standard of microproces- 
sor performance. In developing an application for the 79R3000, it 
is important to understand the fundamental reasons for the proces- 
sor's high standard of performance, and to understand the implica- 
tion of design decisions in terms of performance. In this way, the 
cost of the system can be optimized for the performance level re- 
quired by the application. 

The performance of the 79R3000 is a result of its low average 
clocks per instruction, and its high memory bandwidth. The pipe- 
line stages allow multiple instructions to be in various stages of 
processing simultaneously, as shown in figure 3. (Compiler tech- 
nology serves to insure full use of this parallelism.) 

The instruction cache allows a new instruction to be initiated on 
every processor clock cycle, and the low average instruction la- 
tency means that instructions are completed at close to one cycle 
per instruction. The purpose of the instruction cache is to provide 
the majority of the instructions at the processor clock rate. 

Instruction bandwidth is complemented by data bandwidth. The 
purpose of the system write buffers (such as the IDT 79R3020) is 
to allow the processor to finish with a data write in one clock cycle, 
even if the system takes multiple clock cycles to retire the data. 
The data cache provides for very fast data load operations, and is 
capable of supplying a data operand in every clock cycle (the 
79R3000 can get both an instruction and data item in each clock 
cycle, resulting in 267MBytes/sec bandwidth at 33Mhz). 
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Performance in a given application is thus to a first approxima- 
tion a function of system speed and cache efficiency. That is, more 
cache hits imply that more information is given to the processor at 
the single cycle rate. Coupled with high frequency (short cycle 
time) operation, the 79R3000 achieves ultra-high performance. 
The 79R3000 includes features which further help to minimize the 
impact of cache miss processing. These features include the block 
refill capability, which allows multiple main memory words to be re- 
trieved when performing cache miss processing. Block refill takes 
advantage of the difference between memory latency (the time to 
the first operand), and memory bandwidth (the time between suc- 
cessive operands). The 79R3000 allows the designerto select the 
block refill appropriate to the application. Further, the 79R3000 
performs instruction streaming, which is the simultaneous fetch 
and execution of cached instructions, to result in substantial per- 
formance improvement when filling the cache with a new process 
or task. 

The amount of cache "sufficient” for a given application is a func- 
tion of a number of factors. These factors include the latency to 
main memory (how long on average it takes to receive an operand 
not in the cache), as well as the instruction mix (the reference and 
time locality of the code, the amount of data versus ALU opera- 
tions, etc.). The system designer is free to implement more or less 
cache, as appropriate for the system, depending on his cost and 
performance constraints. 

One tool available to help the system designer determine the ap- 
propriate amount of cache for a given application is the 
Cache-2000 cache simulator. With Cache-2000, the designer 
can "describe” a proposed system and evaluate the performance 
of representative software on that system. Cache-2000 does not 
require the software engineer to modify source code, but rather 
works from the executable image of the code using UNIX profiling 
tools to determine the number of cycles required to execute the 
program on the described system. Inputs to Cache-2000 include 
the cache sizes, processor frequency, main memory latency, write 
buffer depth, and block refill sizes selected. Cache-2000 provides 
a highly accurate simulation of the software performance on the 
system described, and gives the system designer a firm basis for 
making design trade-offs. 

A final consideration exists for real time systems, where a gen- 
eral purpose cache may not satisfy the predictability requirements 
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of the application. A number of techniques for the 79R3000 exist 
which guarantee predictable, high-performance in real time appli- 
cations. These techniques allow the system designerto use either 
hardware or software to lock time critical portions of the operating 
system into cache at all times, guaranteeing single cycle access to 
kernel instructions. Alternately, the system designer could take 
advantage of the on-chip cache controller to construct a memory 
system entirely with synchronous memory devices (e.g. SRAM), 
and achieve the highest levels of system performance and true 
predictability. 

Figure 4 illustrates alternative ways in which the 79R3000 flexi- 
bility and performance can be brought to embedded applications. 
The performance evaluation tools allow the system designer to 
conclude in advance whether a given system architecture is 
appropriate to the task . 

COST TRADEOFFS 

Once the appropriate system configuration is determined, the 
system designer can go about designing the lowest cost system 
achieving the desired level of performance. IDT has written appli- 
cations notes describing ways to reduce system cost while not af- 
fecting application performance. These techniques take advan- 
tage of the nature of various types of applications, and use the inte- 
gration of the 79R3000 to full advantage. By taking advantage of 
these features, the system designer can build "just as much” cache 
as is needed for the application. 

Specifically, the 79R3000 cache controller was designed to im- 
plement a line size of one word (that is, one tag, or main memory 
origin descriptor per 32-bit word in the cache). Further, the cache 
controller was designed to support a full 4 Gigabytes of cacheable 
main memory, and to implement caches between 4k bytes and 
256k Bytes in depth. In doing so, the on-chip cache controller will 
compare 20-bits of tag on each cache cycle. 

However, it is not required that all tag values come from ram ele- 
ments. In systems which implement caches larger than 4k Bytes, it 
is possible to provide a feedback path between the processor out- 
put cache address and the input low-order tags. This can be done 
with a single 74FCT244A buffer/driver, rather than RAM. 
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Figure 4. Typical 79R3000-based Systems 
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Similarly, many embedded applications require less than 4 
Gigabytes of cacheable memory (a number more typically associ- 
ated with reprogrammable applications). In these systems, the de- 
signer can "hard-wire” high-order tags, using resistors or buffers, 
and reduce the cost of the cache by reducing the cacheable mem- 
ory space. This still allows the system designer to access the en- 
tire 4 GigaByte address space for memory decoding, etc. 

Additionally, there is no absolute requirement that the cache line 
size be one word. The system designer could take advantage of 
the processor’s built in block refill capability to implement a larger 
line size. If the line size were increased to only four words, then 
one-fourth as many tags would be required as data elements, and 
the cost of the cache is further reduced. 

THE ROLE OF DEVELOPMENT TOOLS 

For the system designer to properly design the application, it is 
important that sufficient tools be in place. Many of these decisions 
will be determined by the software to be run on the application. 
The 79R3000 features a wide variety of software development 
tools, which allow the concurrent development of both hardware 
and software. These tools allow the software to be developed be- 
fore the target hardware is designed. Thus, the system designer 
works from accurate information regarding the nature of the soft- 
ware, its dynamic execution profile, and the amount of memory re- 
quired to support it. The software developer develops his applica- 
tion on the development host, and uses Sable, the architecture 
and instruction set simulator, to develop all of the application code 
prior to target system availability. 

The hardware designer can also draw on a variety of develop- 
ment and debug tools. In addition to the Cache-2000 program, 
there are a number of other tools used by the hardware designer. 


These include hardware models of the component family, such as 
those available from Mentor or Valid Logic. These models use ac- 
tual components to simulate the interaction of the CPU in the target 
system, while the system exists only in schematic form on the CAD 
platform. This reduces the risk that the first PC board has 
significant errors. 

When the system is being initially debugged, logic analyzer tools 
are used to trace the system activity. The 79R3000 is supported 
on a number of logic analyzers, including Gould, Tektronix, and HP 
family offerings. These analyzers disassemble the bus traffic, and 
are invaluable, non-intrusive aids during the debug process. 

Software tools are also used during system debug. The PROM 
monitor, available from IDT, includes useful debug aids such as 
breakpoint, memory examine and modify, diagnostics, etc. The 
PROM monitor is available in source form, allowing full customiza- 
tion to the requirements of the target system. Thus, the interaction 
of the target software and hardware can be examined and con- 
firmed during the debug stages. 

PRE-PACKAGED SOLUTIONS 

There are a number of other products available to minimize the 
amount of time required for RISC development. These include 
hardware and software products. 

For the hardware designer, IDT offers a family of RISC subsys- 
tems, integrating the CPU and Caches onto a single, small foot- 
print module such as the subsystem shown in figure 5. These 
modules are ideal for initial development and even for full produc- 
tion. The risk associated with high-speed system design is elimi- 
nated, and the customer can focus on his value added software 
and peripherals. 
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For the software designer, a rich toolset is also available. The 
Stand-alone I/O library interfaces between C language I/O calls 
and direct device drivers. The PROM monitor eliminates the need 
for the software developer to develop tools, and thus allows him to 
focus on the software for the application itself. Finally, a wide vari- 
ety of operating system support is available, including real time op- 
erating systems for either C or Ada applications. 

CONCLUSIONS 

The IDT79R3000 architecture is ideal for a wide variety of high- 
performance applications, ranging from embedded applications 
through real-time systems and including high-performance 
reprogrammable applications such as workstations and file 


servers. In addition to its inherent high-performance, the 
79R3000 architecture is flexible enough to allow the system de- 
signerto configure the system according to the application require- 
ments, thus achieving the desired performance at minimum cost. 

In order to bring the performance of the processor to the applica- 
tion system, it is important that a rich and robust toolset be avail- 
able at all stages of the system development cycle. The 79R3000 
provides tools for both the hardware and software teams to use 
during all phases of the development project, as shown in figure 6. 

The end result is the highest-performance processor available 
today, and a toolset which enables application developers to bring 
that performance to their systems at minimum cost and with mini- 
mal development effort. 
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Figure 6. System Development Phases and Support 
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ABSTRACT 

This paper discusses the architecture of the R3001 
RISController™, a derivative of MIPS Computer Systems 
R3000 microprocessor which IDT has developed to address 
the particular needs of embedded systems designers. This 
paper discusses the architectural features of the MIPS RISC 
architecture which make it well suited to embedded 
applications, and discusses the changes to the R3000 
implementation embodied in the R3001 . The paper also gives 
examples of how these changes help embedded system 
designers achieve the goals of their applications. 

INTRODUCTION 

The IDT R3000 microprocessor (the MIPS RISC Processor) 
has found widespread acceptance among re-programmable 
applications such as UNIX™ workstations and server systems. 
Less widely known, but of considerable significance, is the 
growing acceptance in the embedded marketplace in 
applications including real-time control systems, data 
communications, laserprintercontrollers, graphics terminals, 
and avionics controllers. 

This acceptance is testimony to the elegance of the 
instruction set, software, and the basic device implementation. 
In working with such a diversity of customers, however, IDT 
identified a set of significant changes to the R3000 device 
which make it even better suited to solving these types of 
control problems. 

While the R3000 is obviously a good device for many 
embedded applications, it became clear from our customers 
that it would be possible to implement incremental changes to 
the device to simplify and broaden its application in embedded 
systems. IDT has implemented a derivative of the R3000 
which addresses many of the issues the R3000 brought to 
embedded system designers. 

The R3001 is the solution. The design goals of the R3001 
were to: 


• Maintain FULL software compatibility with the R3000, at 
both the kernel and user levels, to maximize the wealth of 
software support (both development and applications) 
available for the MIPS RISC architecture. 

• Allow embedded system designers to realize the perform- 
ance of the R3000 at lower total system cost (fewer 
devices, less power, less board area, etc.). 

• Allow the system designer more options in the design and 
partitioning of the high-performance memory system. 

• Give the system designer full control of all aspects of 
system design; don’t make each system pay for the full set 
of worst case assumptions about systems dramatically 
different from his or her target application. 

• Recognize the needs of real-time deterministic systems, 
and provide solutions to the problems of general purpose 
caches in these applications. 

• Support systems which do not wish to implement “cache", 
such as real-time systems. 

• Support the use of complementary products designed for 
the R3000, such as the high-performance R301 0 Floating 
Point, cache RAMs, interface chips, etc. 

• Support systems which distribute the processing task 
among tightly-coupled heterogenous processing devices, 
such as systems with I/O processors working directly with 
the R3001 local memory. 

• Maintain same pin count as R3000 (which is pad limited; 
higher pin count would thus result in higher end-user 
device cost). Pin compatibility was not a constraint. 

• Achieve the same range of speeds asthe standard R3000. 

• Maintain the full performance of the R3000. 

This paper discusses the architecture of the R3001, and 

how it achieves the above goals. A few design examples are 

included to help clarify how these changes achieve the goals 

established for the project. 


MIPS is a trademark of MIPS Computer Systems, Inc. 

VAX is a trademark of Digital Equipment Corp. 

UNIX is a registered trademark of AT&T. 

RISControllem' is a trademark of Integrated Device Technology, Inc. 


©1990 Integrated Device Technology, Inc. 
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THE R3000 IN EMBEDDED SYSTEMS 

There are a number of reasons why the R3000 has been 

used in embedded systems. The primary reasons are: 

• The R3000 attains the highest levels of performance, 
based on its efficient architecture: 28 VAX Units of 
Performance at 33 MHz, sustained, in reai-world 
application. Other processor architectures require much 
higher frequencies to approximate the performance 
achieved by the R3000. 

• The R3010 Floating Point Acceleratorco-processorbrings 
excellent floating point performance to those systems that 
need it, such as military avionics control systems. 

• The compilers forthe R3000 achieve most of the efficiency 
of programming in assembly language, while offering the 
ease of development of high-level languages. 

• It is possible to use a single basic CPU subsystem design, 
and achieve various levels of performance by either 
varying cache depth, system speed, or memory interface 
(write buffer depth and block refill size), and by using 
hardware or software floating point. 

• The R3000 development environment provides powerful 


software developmenttools, including debugging support, 
system profiling, system performance projection, and target 
system software simulation. This allows system software 
to be developed quickly and in parallel with the target 
hardware, minimizing time-to-market. 

THE R3000 ARCHITECTURE 

The R3000 implements a strictly hierarchical view of 
memory, appropriate for minicomputer systems. In a typical 
R3000 system, a given level of memory is a high-speed cache 
of the larger, slower memory below in the hierarchy, as shown 
in Figure 1 . For example, the on-chip register file contains the 
most frequently used data items; the next high-speed memory 
is implemented as high-speed instruction and data caches for 
the main memory, which is a “cache” of the mass storage 
system. The R3000 utilizes various techniques to manage the 
interaction of the various levels, from register allocation 
algorithms implemented in the compilers to TAG comparison 
of the cache, and an MMU to support a demand paged virtual 
memory system. 


Highest Bus Bandwidth 
(Native Rate of the CPU) 


t 

Lowest Bandwidth 
(Where the Program Resides) 



On-Chip Register File 
Managed by Compilers, 
Programmer’s 
3 Words/ Clock Cycle 


Instruction and Data Caches 
Managed by 
On-Chip Controller 
2 Words/Clock Cycle 


Main Memory 

Capabilities are system dependent. 
Long Latency, but Burst Bandwidth 
of 1 Word/Clock Cycle 


Mass Storage 
Managed by MMU, 
Page Tables 


Figure 1. Hierarchies of Memory in an R3000-Based System 
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The R3000 integrates a direct mapped cache controller on 
chip. The R3000 requires that a full 60 bits of data be provided 
by each cache in each cycle; these 60 bits include 32 bits of 
data, 4 bits of data parity, 20 bits of TAG, a valid bit, and 3 bits 
of TAG parity. This amount of overhead is required because 
the R3000 allows (and assumes) cacheable main memory as 
large as 4 GB, and a cache as small as 4 KB. 

All of these techniques combine to allow the R3000 to bring 
the highest levels of performance to microprocessor-based 
systems. However, these techniques are not necessarily 
appropriate to embedded applications, and thus add complex ity 
and cost to the system design. For example, an embedded 
systemtypically contains all instructions in the system PROMs; 
there is no need to perform demand paging from disks. 
Similarly, embedded systems may not operate in a multi- 
tasking model, but rather run a single executive process. This 
simplifies (or eliminates) the requirement for managing virtual 
memory references. 

In fact, many embedded systems do not wish to deal at all 
with caches. This may be motivated by cost, area, or system 
architectural requirements. For example, a real-time system 
may be vastly complicated by a general purpose cache; the 
system designer can not be guaranteed what is in the cache 
when a given system event occurs, and thus the general 
purpose cache brings no performance gain to the real-time 
system. In fact, in these real-time systems, general purpose 
caches are viewed as “non-deterministic”, and thus contradict 
a basic requirement of real-time applications. 

THE IDT 79R3001 ARCHITECTURE 

From the perspective of a programmer, there is no difference 
between the standard R3000 and the IDT 79R3001. Both 
devices contain the same basic execution core and memory 
management unit, thus eliminating any risk of software 
incompatibility. The R3001 is, therefore, fully software 
compatible with the R3000, at both the kernel and user levels 
of software. Neither user programs nor kernels need to be 
modified to use the R3001 . 

However, the R3001 can present a totally different look to 
the system designer compared to the model assumed by the 
R3000. The R3001 was implemented as a set of design 
changes to the bus-interface of the R3000. These changes 
were selected to maximize the amount of flexibility the system 
designer has when implementing the appropriate memory 
subsystem for his embedded application. Whereas typical 
uni-processorcomputersystems tend to have relatively similar 
memory requirements, embedded applications tend to have 
dramaticallydifferent requirements, depending on the particular 
problem to be solved. 


THE R3001 MEMORY INTERFACE 

While the R3001 does support the hierarchical memory 
structure of the R3000 (at lower system cost) , the RISController 
goes beyond the capabilities of the R3000 by allowing a 
different view of memory. Rather than assuming that the 
highest speed memory is always used as a general purpose 
cache, the R3001 allows the system designer to use this 
memory space in a wide variety of ways. This flexibility is the 
key to the R3001 in embedded systems. 

The R3001 allows the system designer to view the fast 
memory as a synchronous memory space, distinct from 
(ratherthan a cache of) slower memory. This partitioning is at 
the discretion of the system designer; in some systems a 
general purpose cache model might still be used, while in 
other systems a “cacheless” memory system would be 
implemented to assure real-time performance. 

The R3001 directly controls two areas of memory: the 
synchronous memory space, and the asynchronous memory 
space. Synchronous memory is further subdivided into 
separate instruction and data portions, to supply the highest 
possible bandwidth to the processor. Unlike the R3000, it is 
simple for the system designer to implement these memory 
spaces as two different regions of memory, rather than 
implement a hierarchical relationship between them. The 
memory interfaces supported by the R3001 are illustrated in 
Figure 2. 

The R3001 directly controls the synchronous memory 
spaces using a single data bus and single address bus but 
separate sets of control pins (one set for instruction and one 
fordata). The control pins control the external de-multiplexing 
of address (using external transparent latches) and multiplexing 
of data (using the output enable of the memory devices). The 
synchronous interface performs both an instruction and data 
access per clock cycle on alternate phases of the clock. It 
presents an address for the instruction in one phase and 
completes the data transaction in the same phase. In the next 
phase it completes the instruction transaction and initiates a 
new data transaction. The operation of the synchronous 
interface is illustrated in Figure 3. 

In systems which wish to implement a hierarchical 
relationship between the high-speed synchronous memory 
and slower memory, the R3001 simplifies the design of the 
high-speed caches by making the on-chip direct-mapped 
cache controller much more flexible. All the system designer 
needs to do is “widen” the synchronous memory space by 
connecting standard memory devices to the appropriate TAG 
lines of the processor. The CPU will automatically manage the 
cache, detect hits and process misses by accessing the 
asynchronous memory space. 
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Figure 2. Memory Interfaces of IDT’s R3001 
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Figure 3. R3001 Synchronous Memory Interface Operation 


In these hierarchical systems, the R3001 implements 
caches with substantially lower cost than the R3000 does. 
This is because of the flexibility of the synchronous memory 
interface. TheR3001 eliminates TAG parity, and makes data 
parity optional. Additionally, rather than requiring TAGs for a 
model of 4 GB memory and a 4 KB cache, the R3001 only 
requires the amount of TAGs required for that particular 
system. Most embedded systems implement main memory of 
8 MB or less, and build caches of 8 KB or larger. Thus, rather 
than requiring 60 bits per cache to support the R3000 cache 
controller (or 15 high-speed 16Kx4 memories), R3001 based 
systems can be implemented with cache widths of 44 bits or 
less (only 1 1 RAMs per cache). This saves a minimum of 8 
high-speed RAMs per system, as shown in Figure 4. Note, 
however, that disabling the checking of TAG bits does not 
reduce the amount of memory supported by the processor; it 


reduces the amount of memory which may be cached. The full 
4 GB address space is still available (although not all of it is 
cacheable) in the system, simplifying address decoding for 
various types of memory such as memory-mapped I/O devices. 

This same mechanism extends to allow cacheless systems; 
the system designer merely disables all TAG checking. The 
architecture of the processor allows software to separate 
synchronous and asynchronous memory references by using 
virtual addresses that are either “cacheable” (synchronous) or 
“uncacheable” (asynchronous). 

This approach yields direct benefits to real-time systems. 
By eliminating the hierarchical model of memory, the high- 
performance real-time system designer can accurately 
measure and predict critical real-time metrics such as context 
switch time and interrupt latency without worrying about the 
uncertainty associated with general purpose caches. 
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Figure 4. Cost-Reduced R3001 Cached System 


DMA SUPPORT 

Since the R3000 always assumes a cache-based system, 
the only external access to the fast memory supported are 
those required for multi-processor cache coherency. The 
R3000thus incorporates a simple but flexible interface which 
allows an external agent to specify a given cache line, and to 
request that the processor invalidate that line by writing into it. 
System DMA events, such as disk transfers, are assumed to 
only occur in the asynchronous memory space. In this 
memory space, the processor is not a master, but merely 
another requester. 

While this model is appropriate for minicomputer type 
systems, it severely limits the embedded system designer. 
Many embedded systems use different types of specialty 
processors under the control of a single, general purpose 
processor. For example, a threat recognition system might 
contain a DSP subsystem to perform image recognition, and 
use a general purpose processor for prioritization or response 
to threats. Such a system needs a high-bandwidth method of 


communicating between these processors, a natural 
application for DMA transfers. 

The R3001 allows an external master to request 
mastership of the synchronous memory bus and control 
signals, as shown in Figure 5. The DMA controller then can 
quickly transfer data from one processor memory into the 
high-speed R3001 synchronous memory, allowing the 
RISController to process the data (or special instruction 
sequence) rapidly. 

CACHELESS SYSTEM 

A high-performance processor requires a great deal of 
memory bandwidth to keep the execution engine running at 
full speed. Typically, caches are used to supply this bandwidth. 
In many applications, however, performance can be traded for 
cost, and the use of caches can be avoided. Alternately, in 
many other applications, it is feasible to implement a large 
enough fast memory that traditional caching strategies can be 
avoided. 
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Figure 5. R3001 DMA Arbiter Interface 


The system design shown in Figure 6 illustrates a cache- 
less system offering 7 mips of performance (based on real- 
world embedded applications such as page-description 
language interpreters). Rather than using caches, a set of 
ROMs contains the processor instructions, and the processor 
clock rate is slowed to 8 MHz to allow a ROM access every 
clock cycle. In addition to the instruction ROMs in the 
synchronous space, there is a bank of DRAM for main storage 
(in a laser printer or graphics application, this would contain 
the page or screen image and the display lists). Peripheral 
devices, such as communications devices, are also resident 
in the asynchronous memory space. 

All instruction accesses are satisfied by the ROMs, 
providing zero-wait state access to instructions. Since the 
processor clock speed is reduced to 8 MHz, DRAM accesses 
can be satisfied in 3 clock cycles (even if using 1 80 ns DRAMs, 
such as those available in the military market place). Note that 
a small data cache, implemented using just 5 2Kx8 SRAMs at 
90ns (very inexpensive devices) could further increase 
performance by reducing the number of DRAM references 


necessary. Implementing the data cache would merely require 
the inclusion of the address latch and the memory devices. 

A similarsystemcould be implemented usingdense SRAMs 
rather than ROMs, allowing higher speed operation. Such a 
system would probably require the boot ROMs to be resident 
in the asynchronous memory space. This type of system is 
typical of many of the real-time applications of the R300 1 . The 
R3001 makes this possible becauseof its processors ability to 
implement cacheless systems, and because of its support of 
large (16 MB) synchronous memory. 


REAL-TIME CACHE BASED SYSTEM 

It is possible to implement a system with the benefits of 
caching which also satisfies the deterministic requirements of 
the real-time system designer. With the R3001 , the real-time 
system designer can easily implement a specialized caching 
structure which guarantees that critical kernel routines will be 
executed with constant execution time (ratherthan the variable 
execution time which is a result of the uncertainty of whether 
the code is cache resident). 
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Figure 6. R3001 ROM Based, Cacheless System 


The basic technique involves segmenting the cache into 
two portions: a dedicated kerne! section, where the time 
critical code is resident and which tasks can not overwrite, and 
a general purpose section, where the tasks benefit from 
caching but which is non-deterministic. 

Software separates memory areas for the time critical code 
from the general task, according to the addresses used. 
External address decoding is then used to separate kernel 
from general accesses, and select the right cache area 
depending on the decoded address. 


Figure 7 shows the cache subsystem which might be 
implemented for such a design. A high-order address line 
indicates whether the access is for the time critical kernel, or 
is a general access. If a kernel access is indicated, then the 
chip-enable for the kernel cache is activated and the enable 
for the general cache is negated, thus insuring a kernel 
reference. A simple routine at system startup pre-loads the 
critical code into the kernel cache, so that all invocations of 
those routines result in cache “hits”. This achieves the highest 
levels of real-time performance, utilizesthe cachesto minimize 
overall system cost and achieves the highest levels of real- 
time throughput. 
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Figure 7. R3001 Real-Time, Cache Based System 


SUMMARY 

The introduction of the R3001 enables the designers of 
embedded systems to bring the performance inherent in the 
MIPS RISC architecture to their embedded applications by 
satisfying the cost, area, and performance constraints of 
various applications. This allows the designer to use a single 
base architecture throughout their system, for example, by 
using the R3001 to manage I/O processing in an R3000 


based minicomputer. This allows both IDT andourcustomers 
to leverage their experience with the architecture, and has 
proven to be attractive to a large number of our customers. 

IDT will continue to innovate with the MIPS RISC 
architecture, producing other derivative versions of the 
architecture which are well suited to specific embedded 
applications. 


9 


9.17 


9 










DOMESTIC SALES 


ALABAMA 

IDT 

4930 Corporate Dr., Ste. 1 
Huntsville, AL 35805 
(205)721-0211 


ALASKA 

Westerberg & Associates 
Bellevue, WA 
(206) 453-8881 


ARIZONA 

Western High Tech Mktg. 
Scottsdale, AZ 
(602) 860-2702 


ARKANSAS 

IDT 

(S. Central Regional Office) 
14285 Midway Rd., Ste. 100 
Dallas, TX 75244 
(214) 490-6167 


CALIFORNIA 

IDT 

(Corporate Headquarters) 
3236 Scott Blvd. 

P.O. Box 58015 
Santa Clara, CA 
95052-8015 
(408) 727-6116 

IDT 

(Western Headquarters) 
2972 Stender Way 
Santa Clara, CA 95054 
(408) 492-8350 

IDT 

(SW Regional Office) 

6 Jenner Dr., Ste. 100 
Irvine, CA 92718 
(714) 727-4438 

IDT 

(SW Regional Office) 
16130 Ventura Blvd., 

Ste. 370 

Encino, CA 91436 
(818)981-4438 

Quest-Rep 
San Diego, CA 
(619) 565-8797 


CANADA 

(EASTERN) 

Canadian Mktg. Tech. Inc. 
Kanata, ONT 
(613) 591-9555 

Canadian Mktg. Tech. Inc. 
Mississauga, ONT 
(416)612-0900 

Canadian Mktg. Tech. Inc. 
Pointe Claire, QUE 
(514) 694-608 8 

CANADA 

(WESTERN) 

Westerberg & Associates 
Bellevue. WA 
(206) 453-8881 


COLORADO 

IDT 

(NW Regional Office) 
1616 17th St., Ste. 370 
Denver, CO 80202 
(303) 628-5494 

Thorson Rocky Mountain 
Englewood, CO 
(303) 799-3435 


CONNECTICUT 

Lindco Associates 
Woodbury, CT 
(203) 266-0728 


DELAWARE 

IDT 

(NE Regional Office) 
428 Fourth St., Ste. 6 
Annapolis, MD 21403 
(301) 858-5423 


FLORIDA 

IDT 

(SE Regional Office) 

1413 S. Patrick Dr., Ste. 10 
Indian Harbor Beach, FL 
32937 

(407) 773-3412 

IDT 

(SE Regional Office) 

601 Cleveland St., Ste. 400 
Clearwater, FL 34615 
(813)447-2884 


IDT 

(SE Regional Office) 

1500 N.W. 49th St., 

Ste. 500 

Ft. Lauderdale, FL 33309 
(305) 776-5431 


GEORGIA 

IDT 

(SE Regional Office) 

1413 S. Patrick Dr., Ste. 10 
Indian Harbor Beach, FL 
32937 

(407) 773-3412 


HAWAII 

IDT 

(Western Headquarters) 
2972 Stender Way 
Santa Clara, CA 95054 
(408) 492-8350 


IDAHO (NORTHERN) 

Westerberg & Associates 
Bellevue, WA 
(206) 453-8881 


IDAHO (SOUTHERN) 

Westerberg & Associates 
Portland, OR 
(503) 620-1931 


ILLINOIS 

IDT 

(Central Headquarters) 
1375 E. Woodfield Rd., 
Ste. 380 

Schaumburg, IL 60173 
(708)517-1262 

Synmark Sales 
Park Ridge, IL 
(708) 390-9696 


INDIANA 

Arete Sales 
Ft. Wayne, IN 
(219) 423-1478 

Arete Sales 
Greenwood, IN 
(317) 882-4407 


IOWA 

Rep Associates 
Cedar Rapids, IA 
(319) 373-0152 


KANSAS 

Rush & West Associates 
Olathe, KS 
(913) 764-2700 


KENTUCKY 

Norm Case Associates 
Rocky River, OH 
(216)333-0400 


LOUISIANA 

IDT 

(S. Central Regional Office) 
14285 Midway Rd., Ste. 100 
Dallas, TX 75244 
(214) 490-6167 


MAINE 

IDT 

(Eastern Headquarters) 

# 2 Westboro Business Park 
200 Friberg Pkwy., 

Ste. 4002 

Westboro, MA 01581 
(508) 898-9266 


MARYLAND 

IDT 

(NE Regional Office) 
428 Fourth St., Ste. 6 
Annapolis, MD 21403 
(301) 858-5423 


MASSACHUSETTS 

IDT 

(Eastern Headquarters) 
ft 2 Westboro Business Park 
200 Friberg Pkwy., 

Ste. 4002 

Westboro, MA 01581 
(508) 898-9266 


MICHIGAN 

Tritech Sales 
Farmington Hills, Ml 
(313) 442-1200 


MINNESOTA 

OHMS Technology Inc. 
Edina, Ml 
(612)932-2920 


MISSISSIPPI 

IDT 

(SE Regional Office) 

1413 S. Patrick Dr., Ste. 10 
Indian Harbor Beach, FL 
32937 

(407) 773-3412 


MISSOURI 

Rush & West Associates 
St. Louis, MO 
(314)965-3322 


MONTANA 

Thorson Rocky Mountain 
Englewood, CO 
(303) 799-3435 


NEBRASKA 

IDT 

(Central Headquarters) 
1375 E. Woodfield Rd., 
Ste. 380 

Schaumburg, IL 60173 
(708) 517-1262 


NEVADA 

(NORTHERN) 

IDT 

(Western Headquarters) 
2972 Stender Way 
Santa Clara, CA 95054 
(408) 492-8350 


NEVADA 

(SOUTHERN) 

Western High Tech Mktg. 
(Clark County, NV) 
Scottsdale, AZ 
(602) 860-2702 


NEW HAMPSHIRE 

IDT 

(Eastern Headquarters) 

4 2 Westboro Business Park 
200 Friberg Pkwy., 

Ste. 4002 

Westboro, MA 01581 
(508) 898-9266 


NEW JERSEY 

IDT 

(NE Regional Office) 
One Greentree Centre, 
Ste. 202 

Marlton, NJ 08053 
(609) 596-8668 

SJ Associates 
Mt. Laurel, NJ 
(609) 866-1234 


NEW MEXICO 

Western High Tech Mktg. 
Albuquerque, NM 
(505) 884-2256 


NEW YORK 

IDT 

(NE Regional Office) 
250 Mill St., Ste. 107 
Rochester, NY 14614 
(716) 546-4880 


Quality Components 
Buffalo, NY 
(716) 837-5430 

Quality Components 
Manlius, NY 
(315) 682-8885 

SJ Associates 
Rockville Centre, NY 
(516) 536-4242 


NORTH CAROLINA 

Tingen Technical Sales 
Raleigh, NC 
(919)870-6670 


OHIO 

Norm Case Associates 
Rocky River, OH 
(216) 333-0400 


OKLAHOMA 

IDT 

(Central Headquarters) 
1375 E. Woodfield Rd„ 
Ste. 380 

Schaumburg, IL 60173 
(708) 517-1262 


OREGON 

Westerberg & Associates 
Portland, OR 
(503) 620-1931 


PENNSYLVANIA 

(WESTERN) 

Norm Case Associates 
Rocky River, OH 
(216) 333-0400 


PENNSYLVANIA 

(EASTERN) 

SJ Associates 
Rockville Centre, NY 
(516) 536-4242 


RHODE ISLAND 

IDT 

(Eastern Headquarters) 
ft 2 Westboro Business Park 
200 Friberg Pkwy., 

Ste. 4002 

Westboro, MA 01581 
(508) 898-9266 


SOUTH CAROLINA 

IDT 

(SE Regional Office) 

1413 S. Patrick Dr., Ste. 10 
Indian Harbor Beach, FL 
32937 

(407) 773-3412 


TEXAS 

IDT 

(S. Central Regional Office) 
6034 W. Courtyard Dr., 

Ste. 305-48 
Austin, TX 78730 
(512) 338-2440 

IDT 

(S. Central Regional Office) 
14285 Midway Rd„ Ste. 100 
Dallas, TX 75244 
(214)490-6167 


UTAH 

Anderson Associates 
Bountiful, UT 
(801)292-8991 


VERMONT 

IDT 

(Eastern Headquarters) 

42 Westboro Business Park 
200 Friberg Pkwy., 

Ste. 4002 

Westboro, MA 01581 
(508) 898-9266 


VIRGINIA 

IDT 

(NE Regional Office) 
428 Fourth St., Ste. 6 
Annapolis, MD 21403 
(301) 858-5423 


WASHINGTON 

Westerberg & Associates 
Bellevue, WA 
(206) 453-8881 

IDT 

(NW Regional Office) 
7981 168th Ave. N.E., 
Ste. 32 

Redmond, WA 98052 
(206)881-5966 


WEST VIRGINIA 

Norm Case Associates 
Rocky River, OH 
(216) 333-0400 


WISCONSIN 

Synmark Sales 
Park Ridge, IL 
(708) 390-9696 


WYOMING 

Thorson Rocky Mountain 
Englewood, CO 
(303) 799-3435 


IDT TECHNICAL CENTERS 


Integrated Device Technology 

Integrated Device Technology 

INTEGRATED DEVICE TECHNOLOGY, INC. 

(Western Headquarters) 

(South Central Regional Office) 

(European Headquarters/Northern Europe 

2972 Stender Way 

14285 Midway Road, Suite 100 

Regional Office) 

Santa Clara, CA 95054 

Dallas, TX 75244 

21 The Crescent 

(408) 492-8350 

(214)490-6167 

Leatherhead 

Surrey, UK KT228DY 

Tel.: 44-372-377375 

Integrated Device Technology 

Integrated Device Technology 


(Southwestern Regional Office) 

(Eastern Headquarters) 


6 Jenner Drive, Suite 100 

42 Westboro Business Park 


Irvine, CA 92718 

200 Friberg Parkway, Suite 4002 


(714) 727-4438 

Westboro, MA 01581 
(508) 898-9266 



AUTHORIZED DISTRIBUTORS 

HALL-MARK HAMILTON/A VNET INSIGHT VANTAGE ZENTRONICS 

ELECTRONICS ELECTRONICS COMPONENTS 


Contact your local office. 


INTERNATIONAL SALES 


AUSTRALIA 

George Brown Group 
Rydalmere, Australia 
Tel.: 612-638-1999 

George Brown Group 
Hilton, Australia 
Tel.: 618-352-2222 

George Brown Group 
Blackburn, Australia 
Tel.: 613-878-8111 


AUSTRIA 

Ing. Erst. Steiner 
Vienna, Austria 
Tel.: 43-222-827-4740 


BELGIUM 

Betea S.A. 

Sint-Stevens-Wolnne, 

Belgium 

Tel.: 323-736-1080 


DENMARK 

Exatec A/S 

Copenhagen, Denmark 
Tel.: 45-31-191022 


FEDERAL REPUBLIC 
OF GERMANY 

IDT 

(Central Europe Regional 
Office) 

Gottfried-Von-Cramm-Str. 1 
8056 Neufahrn 
Federal Repulic of Germany 
Tel.: 49-8165-5024 

Dacom GmbH 
Stuttgart, FRG 
Tel.: 49-711-780-6810 

Dacom GmbH 
Ismaning, FRG 
Tel.: 49-89-964-880 

Dacom GmbH 
Buxheim, FRG 
Tel.: 49-08-458-4003 

Dacom GmbH 
Soligen, FRG 
Tel.: 49-21-259-3011 

Dacom GmbH 
Karlsruhe, FRG 
Tel.: 49-72-14-7193 


Dacom GmbH 
Sarstedt, FRG 
49-89-5066-5160 

Scantec GmbH 
Planegg, FRG 
Tel.: 49-89-859-8021 

Scantec GmbH 
Kirchheim, FRG 
Tel.: 49-89-70-215-4027 

Scantec GmbH 
Ruckersdorf, FRG 
Tel.: 49-89-91-157-9529 

Topas Electronic GmbH 
Hannover, FRG 
Tel.: 49-51-113-1217 

Topas Electronc GmgH 
Quickborn, FRG 
Tel.: 49-4106-73097 


FINLAND 

Comodo Oy 
Helsinki, Finland 
Tel.: 358-0757-2266 


FRANCE 

IDT 

(Southern Europe Regional 
Office) 

15 Rue du Buis son aux 
Fraises 

91300 Massy, France 
Tel.: 33-1-69-30-89-00 

Scientec REA 
Chatillon, France 
Tel.: 33-149-652750 

Scientec REA 
Cesson-Sevigne, France 
Tel.: 33-99-32-1544 

Scientec REA 
Saint Etienne, France 
Tel.: 33-77-79-7970 

Scientec REA 
Venissieux, France 
Tel.: 33-78-00-0415 

Scientec REA 
Cedex, France 
Tel.: 33-61-39-0989 

A2M 

BUC, France 
Tel.: 33-39-54-9113 


Aquitech 
Merignac, France 
Tel.: 33-56-55-1830 

Aquitech 

Cedex, France 

Tel.: 33-1-40-96-9494 

Aquitech 
Rennes, France 
Tel.: 33-99-78-3132 

Aquitech 

Lyon, France 

Tel.: 33-72-73-2412 


HONG KONG 

IDT 

(Hong Kong Regional Office) 
Unit 329, 3/F Asia Business 
Centre 

The Centre Mark, 

287-299 Queen's Road 

Central 

Hong Kong 

Tel.: 852-542-0067 

Lestina International Ltd. 
Kowloon, Hong Kong 
Tel.: 852-735-1736 


INDIA 

Malhar Corp. 
Rosemont, PA 
Tel.: 215-527-5020 


ISRAEL 

Vectronics, Ltd. 
Herzlia, Israel 
Tel.: 972-52-556070 


ITALY 

Lasi Electronica 
Milano, Italy 
Tel.: 39-66-101370 

Microelit SRL 
Milan, Italy 
Tel.: 39-2-469044 

Microelit SRL 
Rome, Italy 
Tel.: 39-6-8894323 


JAPAN 

IDT 

(Japan Headquarters) 

U.S. Bldg. 201 
1-6-15 Hirakarasho, 
Chiyoda-Ku 
Tokyo 102, Japan 
Tel.: 81-3-221-9821 

Dia Semicon Systems 
Tokyo, Japan 
Tel.: 81-3-439-2700 

Kanematsu Semiconductor 
Corp. 

Tokyo, Japan 
Tel.: 81-3-511-7791 

Marubun 
Tokyo, Japan 
Tel.: 81-3-639-9897 

NKK Corp. 

Tokyo, Japan 
Tel.: 81-3-228-3826 

Tachibana Tectron Co., 

Ltd. 

Tokyo, Japan 
Tel.: 81-3-793-1171 


KOREA 

Eastern Electronics 
Seoul, Korea 
Tel.: 822-566-0514 


NETHERLANDS 

Auriema 

Eindhoven, Netherlands 
Tel.: 31-40-816565 


NORWAY 

Eltron A/S 
Oslo, Norway 
Tel.: 47-2-500650 


SINGAPORE 

Data Source Pte. Ltd. 
Lorong, Singapore 
Tel.: 65-291-8311 


SOUTH AMERICA 

Intectra Inc. 

Mountain View, CA 
Tel.: 415-967-8818 


SPAIN 

Anatronic, S.A. 

Madrid, Spain 
Tel.: 34-154-24455 

Anatronic, S.A. 
Barcelona, Spain 
Tel.: 34-3-258-1906 

SWEDEN 

Svensk Teleindustri AB 
Spanga, Sweden 
Tel.: 46-8-761-7300 


SWITZERLAND 

W. Stolz AG 
Baden-Daettwil, 
Switzerland 
Tel.: 41-56-849000 

W. Stolz AG 
Lausanne, Switzerland 
Tel.: 41-21-274838 


TAIWAN 

Johnson Trading Company 
Taipei, Taiwan 
Tel.: 886-273-31211 

General Industries Inc. 
Taipei, Taiwan 
Tel.: 886-2764-5126 


UNITED KINGDOM 

IDT 

(European Headquarters/ 
Northern Europe Regional 
Office) 

21 The Crescent 
Leatherhead 
Surrey, UK KT228DY 
Tel.: 44-372-363339 

Micro Call, Ltd. 

Thame Oxon, England 
Tel.: 44-84-261-939 




3236 Scott Boulevard 

Santa Clara, CA 95054-3090 

(408) 727-6116 FAX: (408) 492-8674 
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